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(54) Title: NOVEL MAMMALIAN NUCLEAR RECEPTOR L66 AND METHODS OF USE 

® (57) Abstract: The present invention relates to a novel nuclear receptor called "L66" or also FXR-0 a homologue of the FXR-a, a 
O prototypical type 2 nuclear receptor. The invention also relates to the isolated nucleic acid sequence of L66 and the isolated protein 
^ thereof. The invention further relates to processes for isolating and/or producing the nucleic acid or the protein as well as methods 
^ of use of the receptor L66. 
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Title: 



NOVEL MAMMALIAN NUCLEAR RECEPTOR L66 AND METHODS OF USE 



20 BACKGROUND OF THE INVENTION 

Multicellular organisms are dependent on advanced mechanisms of information 
transfer between cells and body compartments. The information that is transmitted 
can be highly complex and can result in the alteration of genetic programs involved in 
cellular differentiation, proliferation, or reproduction. The signals, or hormones, are 
often simple molecules, such as peptides, fatty acid, or cholesterol derivatives, 
s 

Many of these signals produce their effects by ultimately changing the transcription of 
specific genes. One well-studied group of proteins that mediate a cell's response to a 
30 variety of signals is the family of transcription factors known as nuclear receptors, 
hereinafter referred to often as "NR". Members of this group include receptors for 
steroid hormones, vitamin D, ecdysone, cis and trans retinoic acid, thyroid hormone, 
bile acids, cholesterol-derivatives, fatty acids (and other peroxisomal proliferators), as 
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well as so-called orphan receptors, proteins that are structurally similar to other 
members of this group, but for which no llgands are known (Escriva, H. et al M Ligand 
binding was acquired during evolution of nuclear receptors, PNAS, 94, 6803 - 6808, 
1997). Orphan receptors may be indicative of unknown signaling pathways in the cell 
or may be nuclear receptors that function without ligand activation. The activation of 
transcription by some of these orphan receptors may occur in the absence of an 
exogenous ligand and/or through signal transduction pathways originating from the 
cell surface (Mangelsdorf, D. J. et al., The nuclear receptor superfamily: the second 
decade, Cell 83, 835-839, 1995). 

10 

In general, three functional domains have been defined in NRs. An amino terminal 
domain is believed to have some regulatory function. A DNA-binding domain 
hereinafter referred to as "DBD" usually comprises two zinc finger elements and 
recognizes a specific Hormone Responsive Element hereinafter referred to as "HRE" 
within the promoters of responsive genes. Specific amino acid residues in the "DBD" 
have been shown to confer DNA sequence binding specificity (Schena, M. & 
Yamamoto, K.R., Mammalian Glucocorticoid Receptor Derivatives Enhance 
, ™ x ( !; 

hereinafter referred to as "LBD" is at the carboxy-terminal region of known NRs. In t 
20 the absence of hormone, the LBD appears to interfere with the interaction of the DBD 
with its HRE. Hormone binding seems to result in a conformational change in the NR 
and thus opens this interference (Brzozowski et aL, Molecular basis of agonism and 
antagonism in the oestogen receptor, Nature, 389, 753 - 758, 1997; Wagner et al., A 
structural role for hormone in the thyroid hormone receptor, Nature, 378, 690 - 697. 
1995). A NR without the HBD constitutively activates transcription but at a low level. 

Both the amino-terminal domain and the LBD appear to have transcription activation 
functions hereinafter referred to as "TAF". Acidic residues in the amino-terminal 
domains of some nuclear receptors may be important for these transcription factors 
30 to interact with RNA polymerase. TAF activity may be dependent on interactions with 
other protein factors or nuclear components (Diamond et al., Transcription Factor 
Interactions: Selectors of Positive or Negative Regulation from a Single DNA 
Element, Science, 249:1266-1272 , 1990). Certain oncoproteins (e.g., c-Jun and c- 
Fos) can show synergistic or antagonistic activity with glucocorticoid receptors (GR) 
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in transfected cells. Furthermore, the receptors for estrogen and vitamins A and D, 
and fatty acids have been shown to interact, either physically or functionally, with the 
Jun and Fos components of AP-1 in the transactivation of steroid- or AP-1 regulated 
genes. 

Coactivators or transcriptional activators are proposed to bridge between sequence 
specific transcription factors, the basal transcription machinery and in addition to 
influence the chromatin structure of a target cell. Several proteins like SROl, ACTR, 
and Gripl interact with NRs in a ligand enhanced manner (Heery et a!., A signature 
10 motif in transcriptional coactivators mediates binding to nuclear receptors, Nature, 
387, 733 - 736; Heinzel et al., A complex containing N-CoR, mSin3 and histone 
deacetylase mediates transcriptional repression, Nature 387, 43-47, 1997). 
Furthermore, the physical interaction with negative receptor-interacting proteins or 
corepressors has been demonstrated (Xu et al., Coactivator and Corepressor 
complexes in nuclear receptor function, Curr Opin Genet Dev, 9 (2), 140 - 147, 
1999). 

Nuclear receptor modulators like steroid hormones affect the growth and function of 
specific cells by binding to intracellular receptors and forming nuclear receptor-ligand 
20 complexes. Nuclear receptor-hormone complexes then interact with a hormone 
response element (HRE) in. the control region of specific genes and alter specific 
gene expression. 

Over the past decade, new members of the nuclear hormone gene family have been 
identified that lack known ligands. These orphan receptors can be used to uncover 
-signaling molecules that regulate yet unidentified physiological networks. Some of 
these orphan receptors are constitutively active in transactivate target genes without 
the need to interact with a ligand (Mangelsdorfet al., 1995). 

30 Farnesoid X Receptor alpha (hereinafter FXR-a) is a prototypical type 2 nuclear receptor {US Pat. 6,005,086) 
which activates genes upon binding to promoter region of target genes in a heterodimeric fashion with Retinoid 
X Receptor (hereinafter FOCR, Forman etal., Cell, 81, 687-93, 1995).. The relevant 
physiological ligands of FXR-a seem to be bile acids. The most potent is chenodeoxycholic acid, which 
regulates the expression of several genes that participate in bile acid homeostisis. Farnesoid, originally 
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described to activate the rat ortholog at high concentration does not activate the human or mouse receptor. It 
is highly expressed in the liver, intestine and kidney. Like LXR-a FXR-a is involved in intracrine signaling. 

The relevant physiological ligands of NR1H4 (as FXR-a is also called) seem to be 
bile acids (Makishima et al., Science, 284, 1362-65, 1999; Parks et aL, Science, 284, 
1365-68, 1999). The most potent is chenodeoxycholic acid, which regulates the 
expression of several genes that participate in bile acid homeostasis. 

Consequently, FXR-a is proposed to be a nuclear bile acid sensor. As a result, it modulates both, the synthetic 
1 0 output of bile acids in the fiver and their recycling in the mtestim (by regulating bile acid binding protons). Its 
is also activated by retoinic acid and TTNPB at supraphisiological concentration. Furthermore, it regulates the 
conversion of dietry cholesterol into bile acids by regulation the metabolizing genes like CYP7-a This is a feed 
back regulation since the receptor is activated by bile acids. 

Through its regulatory function in cholesterol and bile acid metabolism an FXR-a homologue could serve as a 
target for cholesterol lowering drugs and exert beneficial effects in diseases like artheriosclerosis and other 
metabolic disorders. 



It was thus an object of the present invention to provide for a novel nuclear receptor. 
In a preferred embodiment of the invention it was an object to provide for a 
homologue of FXR-a. It was an object of the present invention to provide for means 
of producing this receptor as well as means of screening for agonists and antagonists 
to the receptor. Further objects of the invention are outlined below. 

-SUMMARY OF THE INVENTION 

The present invention provides, inter alia, a novel nuclear receptor protein. In a 
preferred embodiment of the invention a novel FXR-a homologue is provided for. 
Also provided is the nucleic acid sequence encoding this novel nuclear receptor 
30 protein, as well as compounds and methods for using this protein and its nucleic acid 
sequence. 
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The present invention provides a novel proteins, nucleic acids, and methods useful 
for developing and identifying compounds for the treatment of such diseases and 
disorders as metabolic disorders, immunological indications, hormonal dysfunctions, 
neurosystemic diseases and in preferred embodiments, high cholesterol and artheriosclerosis 
as well as other metabolic disorders. 

Identified and disclosed herein is the protein sequence for a novel nuclear receptor 
and the nucleic acid sequence encoding this nuclear receptor L66, which we call the 
L66 nuclear receptor (or simply R L66 W ) receptor, or also FXR-p. 

The importance of this discovery is manifested in the effects of FXR-p to modulate 
genes involved in cellular functions like regulation of metabolism and ceil 
homeostasis, cell proliferation and differentiation, pathological cellular aberrations, or 
cellular defense mechanisms including tumor development, Le. cancer. 

Thus, this L66 protein is useful for screening for L66 agonists and antagonist activity 
for controlling these conditions. 



In one aspect of the present invention, we provide isolated nucleic acid sequences 
for a novel receptor, the L66 receptor. In particular, we provide the cDNA sequences, 
protein sequences as well as the genomic sequences encoding the human L66 
receptor, as well as the cDNA sequence, protein sequence and genomic sequence of 
the Mus musculus (mouse) receptor. 

These nucleic acid sequences have a variety of uses. For example, they are useful 
for making vectors and for transforming cells, both of which are ultimately useful for 
production of the L66 protein. 

They are also useful as scientific research tools for developing nucleic acid probes 
30 for determining L66 expression levels, e.g., to identify diseased or otherwise 

abnormal states. They are useful for developing analytical tools such as anti sense 
oligonucleotides for selectively inhibiting expression of the L66 gene to det rmine 
physiological responses. 
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In another aspect of the present invention, we provide a homogenous composition 
comprising the L66 protein. The protein is useful for screening drugs for agonist and 
antagonist activity, and, therefore, for screening for drugs useful in regulating 
physiological responses associated with L66. Specifically, antagonists to the L66 
receptor could be used to treat diseases and disorders as metabolic disorders, 
immunological indications, hormonal dysfunctions, neurosystemic diseases and in 
preferred embodiments, high cholesterol and arteriosclerosis as well as other metabolic disorders, 
whereas agonists could be used for the treatment of these conditions. The proteins 
are also useful for developing antibodies for detection of the protein. 

Flowing from the foregoing are a number of other aspects of the invention, including 

(a) vectors, such as plasmids, comprising the L66 nuclear receptor nucleic acid 
sequence that may further comprise additional regulatory elements, e.g., promoters, 

(b) transformed cells that express the L66, (c) nucleic acid probes, (d) antisense 
oligonucleotides, (e) agonists, (f) antagonists, and (g) transgenic mammals. Further 
aspects of the invention comprise methods for making and using the foregoing 
compounds and compositions. 



The foregoing merely summarizes certain aspects of the present invention and is not 
20 intended, nor should it be construed, to limit the invention in any manner. All patents 
and other publications recited herein are hereby incorporated by reference in their 
entirety. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

THE L66 PROTEIN AND NUCLEIC ACID: 

The present invention comprises a novel member of the nuclear receptor superfamily 
which the inventors herein refer to as L66. Particularly preferred embodiments of the 
L66 receptor are those having an amino acid sequence substantially the same as 
30 SEQ ID NO. 3, 24 and/or 19. Examination of the amino acid sequence confirms that 
the present protein is indeed a member of the nuclear receptor family (see also US 
6,005,086) which is closely related to FXR (see also figure 3 for the domain 
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composition). The carboxy-terminal ligand binding domain "LBD" of L66 is a complex 
region encoding subdomains for ligand binding, often dimerization and transcriptional 
activation. 

The nucleic acids claimed above may be present in various forms, i.e. as an RNA, 
DNA, cDNA or as genomic DNA. 

As used herein, if reference to L66, the L66 receptor, the nuclear receptor L66 or the 
L66 nuclear receptor is made it is meant as a reference to any protein having an 
amino acid sequence substantially the same as SEQ ID NO.: 3, 24 and/or 19. 
The present invention also comprises nucleic acid sequences encoding the L66 
receptor, which nucleic acid sequences are substantially the same as SEQ ID NO. 
land SEQ ID NO. 17 (cDNAs) or splice variants thereof 4, 6, 8, 10, 12, 22, their 
sequence complements SEQ ID NO. 2 and 18 or complements of said splice variants 
5, 7, 9, 1 1 , 13, 23. SEQ ID NO 4encodes the human cDNA L66 receptor and is a 
preferred embodiment. SEQ OD NO. 17 (cDNA) encodes the mouse receptor (Mus 
musculus). SEQ ID NO. 20 represents the genomic sequence of L66 locus from the 

sequence according to the invention comprises the sequence according to SEQ ID 
NO. 1, 4, 6, 8, 10, 12, 17, 20( 4 to 12 representing various splice variants of the 
human L66) and/or 22 or portions thereof, in a preferred embodiment nucleic acid 
sequence according to the invention consists of the sequence according to SEQ ID 
NO. 1, 4, 6, 8, 10 ( 4 to 10 representing various splice variants of the human L66) 
and/or 1 7 or portions thereof. 

Herein the "complement* refers to the complementary strand of the nucleic acid 
according to the invention, thus the strand that would hybridize to the nucleic acid 
according to the invention. In accordance with standard biological terminology all 
DNA sequences herein are however written in 5-3' orientation, thus the if a 
30 complement is mentioned (see also figures) it is actually a "reverse" complement (as 
also stated in the figures). For simplification purposes they may however sometimes 
be referred to simply as "complements". 
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As used herein, a protein "having an amino acid sequence substantially the same or 
similar as SEQ ID NO x" (where "x" is the number of one of the protein sequences 
recited in the Sequence Listing) means a protein whose amino acid sequence is the 
same as SEQ ID NO x or differs only in a way such that at least 50% of the residues 
compared in a sequence alignment with SEQ ID NO. x are identical, preferably 75% 
of the residues are identical, even more preferably 95% of the residues are identical 
and most preferably at least 98% of the residues are identical. 

Those skilled in the art will appreciate that conservative substitutions of amino acids 
can be made without significantly diminishing the protein's affinity for interacting 
proteins, DNA binding sites, L66 receptor modulators, e.g. small molecular 
hydrophobic compounds, or RNA. 

Other substitutions may be made that increase the protein's affinity for these 
compounds. Making and identifying such proteins is a routine matter given the 
teachings herein, and can be accomplished, for example, by altering the nucleic acid 
sequence encoding the protein (as disclosed herein), inserting it into a vector, 

*J!Iiy?' e ' c ac ' c ' sec l uence » anc l measuring the binding 
affinity of the resulting protein, all as taught herein. 

As used herein the term n a molecule having a nucleotide sequence substantially the 
same as SEQ ID NO /' (wherein y is the number of one of the protein-encoding 
nucleotide sequences listed in the Sequence Listing) means a nucleic add encoding 
a protein "having an amino acid sequence substantially the same as SEQ ID NO y+l" 
(wherein "y+P is the number of the amino acid sequence for which nucleotide 
- sequence y codes) as defined above. This definition is intended to encompass 
natural allelic variations in the L66 sequence. Cloned nucleic acid provided by the 
present invention may encode L66 protein of any species of origin, including (but not 
limited to), for example, mouse, rat, rabbit, cat, dog, primate, and human. 

Preferably, the nucleic acid provided by the invention encodes L66 receptors of 
mammalian, preferably mouse and most preferably human origin. 
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Preferably, the L66 receptors proteins provided by the invention are of mammalian, 
more preferably mouse and most preferably human origin. 

The inventors have found (see figures and examples) that the L66 gene in humans is 
expressed primarily in testis. In mice however expression may be observed also in 
other tissues. 



IDENTIFICATION OF VARIANTS AND HOMOLOGUES AS WELL AS USE OF 
10 PROBES: 

Nucleic acid hybridization probes provided by the invention are nucleic acids 
consisting essentially of the nucleotide sequences complementary to any sequence 
depicted in SEQ ID NO. 1, 4, 6, 8, 10, 12, 17, 20, 22, 2, 5, 7, 9, 11, 13, 18, 21 and/or 
23 or a part thereof and that are effective in nucleic acid hybridization 

Nucleic acid hybridization probes provided by the invention are nucleic acids capable 
of detecting i.e. hybridizingl^the gene^^ 
ID NO. 3, 24 and/or 19. 

20 

Nucleic acid probes are useful for detecting L66 gene expression in cells and tissues 
using techniques well-known in the art, including, but not limited to, Northern blot 
hybridization, in situ hybridization, and Southern hybridization to reverse 
transcriptase - polymerase chain reaction product DNAs. The probes provided by the 
_ present invention, including oligonucleotide probes derived therefrom, are also useful 
for Southern hybridization of mammalian, preferably human, genomic DNA for 
screening for restriction fragment length polymorphism (RFLP) associated with 
certain genetic disorders. As used herein, the term complementary means a nucleic 
acid having a sequence that is sufficiently complementary in the Watson-Crick sense 
30 to a target nucleic acid to bind to the target under physiological conditions or 

experimental conditions those skilled in the art routinely use when employing probes. 

It is understood in the art that a nucleic acid sequence will hybridize with a 
complementary nucleic acid sequence under high stringent conditions as defined 
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herein, even though some mismatches may be present Such closely matched, but 
not perfectly complementary sequences are also encompassed by the present 
invention. For example, differences may occur through genetic code degeneracy, or 
by naturally occurring or man made mutations and such mismatched sequences 
would still be encompassed by the present claimed invention. 

Preferably, the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO. 1 f 4, 
17, and/or 22 or splice variants thereof) and/or their complements can be used to 
derive oligonucleotide fragments (probes) of various length. If the probe is used to 

10 detect a human L66 sequence most preferably a complement of SEQ ID NO. 1 
and/or 4 or its complement is used. If the probe is supposed to detect a mouse or a 
rodent sequence probes complementary to SEQ ID NO. 17 and/or 22, or their 
respective complements are preferred. Stretches of 17 to 30 nucleotides are used 
frequently but depending on the screening parameters longer sequences as 40, 50, 
100, 150 up to the full length of the sequence may be used. Those probes can be 
synthesized chemically and are obtained readily from commercial oligonucleotide 
providers. Chemical synthesis has improved over tine years and chemical synthesis 
— « J9lEl!S^^ as ' on 9 as 100 " 2 °0 bases is possible. The field might advance 

further to allow chemical synthesis of even longer fragments. Alternatively, probes 

20 can also be obtained by biochemical de novo synthesis of single stranded DNA. In 
this case the nucleotide sequence of the nuclear receptor L66 or its complement (see 
figures) serve as a template and the corresponding complementary strand is 
synthesized. A variety of standard techniques such as nick translation or primer 
extension from specific primers or short random oligonucleotides can be used to 
synthesize the probe (Molecular Cloning: A Laboratory Manual (3 Volume Set) by 
- Joseph Sambrook, David W. Russell, Joe Sambrook, 2100 pages 3rd edition 
(January 15, 2001; . Molecular cloning: a laboratory manual. Cold Spring Harbor 
Press, Cold Spring Harbor, 1989)). Nucleic acid reproduction technologies 
exemplified by the polymerase chain reaction (Saiki, R.K. et a/. Primer-directed 

30 enzymatic amplification of DNA with a thermostable DNA polymerase. Science 239, 
487-491 (1988)) are commonly applied to synthesize probes. In the case of 
techniques using specific primers the nucleic acid sequence of the nuclear receptor 
L66 or its complement is not only used as a template in the biochemical reaction but 
also to derive the specific primers which are needed to prime the reaction. 
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In some cases one might also consider to use the nucleic acid sequence of the 
cofactor or its complement as a template to synthesize an RNA probe. A promoter 
sequence for a DNA-dependent RNA polymerase has to be introduced at the 5-end 
of sequence. As an example this can be done by cloning the sequence into a vector 
which carries the respective promoter sequence. It is also possible to introduce the 
needed sequence by synthesizing a primer with the needed promoter in the form of a 
5' "tail". The chemical synthesis of a RNA probe is another option. 

10 Appropriate means are available to detect the event of a hybridization. There is a 

wide variety of labels and detection systems, e.g. radioactive isotopes, fluorescent or 
chemiluminescent molecules which can be linked to the probe. Furthermore, there 
are methods of introducing haptens which can be detected by antibodies or other 
ligands such as the avidin/biotin high affinity binding system. 

Hybridization can take place in solution or on solid phase or in combinations of the 
two, e.g. hybridization in solution and subsequent capture of the hybridization product 

20 Hybridization probes act by forming selectively duplex molecules with complementary 
stretches of a sequence of a gene or a cDNA. The selectivity of the process can be 
controlled by varying the conditions of hybridization. To select sequences which are 
identical highly homologous to the sequence of interest stringent conditions for the 
hybridization will be used, e.g. low salt in the range of 0.02 M to 0.15 M salt and/or 
high temperatures in the range from 50°C degrees centigrade to 70°C degrees 
centigrade. Stringency can be farther improved by the addition of formamide to the 
hybridization solution. The use of stringent conditions which means that only little 
mismatch or a complete match will lead to a hybridization product would be used to 
isolate closely related members of the same gene family. Thus, as used herein 

30 stringent hybridization conditions are those where between 0.02 M to 0.15 M salt 
and/or high temperatures in the range from 50°C degrees centigrade to 70°C 
degrees centigrade are applied. 

I 
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The use of highly stringent conditions or conditions of "high stringency" means that 
only very little mismatch or a complete match which lead to a hybridization product 
would be used to isolate very closely related members of the same gene family. 
Thus, as used herein highly stringent hybridization conditions are those where 
between 0.02 - 0.3 M salt and 65°C degrees centigrade are applied for about 5 to 18 
hours of hybridization time and additionally, the sample filters are washed twice for 
about 15 minutes each at between 60°C - 65°C degrees centigrade, wherein the first 
washing fluid contains about 0.1 M salt (NaCI and/or Sodium Citrate) and the second 
contains only about 0.02 M salt (NaCI and/or Sodium Citrate). In a preferred 
1 0 embodiment the following conditions are considered to be highly stringent: 

Hybridisation in a buffer containing 2 x SSC (0.03 M Sodium Citrate, 0.3 M NaCI) at 
65°C - 68°C degrees centigrade for 12 hours, followed by a washing step for 15 
minutes in 0.5 x SSC, 0.1% SDS, and a washing step for 15 minutes at 65°C degrees 
centigrade in 0.1 x SSC, 0.1% SDS. 

Less stringent hybridization conditions, e.g. 0.15 M salt - 1 M salt and/or 
temperatures from 22°C degrees centigrade to 56°C degrees centigrade are applied 

20 orthologous sequences from other species. 

Unspecific hybridization products are removed by washing the reaction products 
repeatedly in 2 x SSC solution and increasing the temperature. 



DEGENERATE PCR AND CLONING OF HOMOLOGIES 

The nucleotide sequence of the nuclear receptor L66 or its complement can be used 
to design primers for a polymerase chain reaction. Due to the degeneracy of the 
genetic code the respective amino acid sequence is used to design oligonucleotides 
30 in which varying bases coding for the same amino acid are included. Numerous 
design rules for degenerate primers have been published (Compton et al, 1990). As 
in hybridization there are a number of factors known to vary the stringency of the 
PCR. The most important parameter is the annealing temperature. To allow 
annealing of primers with imperfect matches annealing temperatures are often much 
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lower than the standard annealing temperature of 55°C, e.g. 35°C to 52°C degrees 
can be chosen. PGR reaction products can be cloned. Either th PCR product is 
cloned directly, with reagents and protocols from commercial manufacturers (e.g. 
from Invitrogen, San Diego, USA). Alternatively, restriction sites can be introduced 
intra the PCR product via a 5-tail of the PCR primers and used for cloning. Primers 
for the amplification of the entire or partial pieces of the L66 gene or mRNA, or for 
reverse transcription may be designed making use of the sequences according to the 
invention, i.e. those depicted in the figures below. 

10 GENETIC VARIANTS 

Fragments from the nucleotide sequence of the nuclear receptor L66 (SEQ ID NO. 1, 
4 or the mouse, ,i.e. 1 7 or 22) or their complements can be used to cover the whole 
sequence with overlapping sets of PCR primers. Also the genomic sequences may 
be used (see figures for sequences). These primers are used to produce PCR 
products using genomic DNA from a human diversity panel of healthy individuals or 
genomic DNA from individuals which are phenotypically conspicuous. Also the 
"^e 

applicant (deposit number DSM 14483 ) or that of the mouse according to SEQ ID 
20 NO. 20 (or the complement thereof). The PCR products can be screened for 

polymorphisms, for example by denaturing gradient gel electrophoresis, binding to 
proteins detecting mismatches or cleaving heterodupiices or by denaturing high- 
performance liquid chromatography. Products which display mutations need to be 
sequenced to identify the nature of the mutation. Alternatively, PCR products can be 
sequenced directly omitting the mutation screening step to identify genetic 
polymorphisms. If genetic variants are identified and are associated with a discrete 
phenotype, these genetic variations can be included in diagnostic assays. The 
normal variation of the human population is of interest in designing screening assays 
as some variants might interact better or worse with a respective lead substance (a 
30 pharmacodynamic application). Polymorphisms or mutations which can be correlated 
to phenotypic outcome are a tool to extend the knowledge and the commercial 
applicability of the nucleotide sequence of the nuclear receptor L66 or its 
complement or their gene product, as variants might have a slightly different 
molecular behavior or desired properties. Disease-causing mutations or 
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polymorphisms allow the replacement of this disease inducing gene copy with a wild- 
type copy by means of gene therapy approaches and/or the modulation of th activity 
of the gene product by drugs. 



PREPARATION OF POLYNUCLEOTIDES: 

DNA which encodes receptor L66 may be obtained, in view of the instant disclosure, 
by chemical synthesis, by screening reverse transcripts of mRNA from appropriate 
cells or ceil line cultures, by screening genomic libraries from appropriate cells, or by 
1 0 combinations of these procedures, as illustrated below. 

Screening of mRNA or genomic DNA may be carried out with oligonucleotide probes 
generated from the L66 nucleotide sequences information provided herein. 

Probes may be labeled with a detectable group such as a fluorescent group, a 
radioactive atom or a chemiluminescent group in accordance with known procedures 
and used in conventional hybridization assays, as described in greater detail in the 
Examples below. Alternatively, the L66 nucleotide sequence may be obtainedby use" 
of the polymerase chain reaction (PCR) procedure, with the PCR oligonucleotide 
primers being produced from the L66 nucleotide sequences provided herein. 

Upon purification or synthesis, the nucleic acid according to the invention may be 
labeled, e.g. for use as a probe. 

-As single and differential labeling agents and methods, any agents and methods 
which are known in the art can be used. For example, single and differential labels 
may consist of the group comprising enzymes such as p-galactosidase, alkaline 
phosphatase and peroxidase, enzyme substrates, coenzymes, dyes, chromophores, 
fluorescent, chemiluminescent and bioluminescent labels such as FITC, Cy5, Cy5.5, 
30 Cy7, Texas-Red and lRD40(Chen et al. (1993), J. Ghromatog. A 652: 355-360 and 
Kambara et al. (1992), Electrophoresis 13: 542-546), ligands or haptens such as 
biotin, and radioactive isotopes such as 3 H, ^S, ^P 125 l and 14 C. 
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EXPRESSION OF THE L66 PROTEIN/POLYPET1DE: 

The nuclear receptor L66 nucleic acid or polypeptide may be synthesized in host 
cells transformed with a recombinant expression construct comprising a nucleic acid 
encoding the nuclear receptor L66. 

Such a recombinant expression construct can also be comprised of a vector that is a 
replicable DNA construct 

10 Amplification vectors do not require expression control domains. All that is needed is 
the ability to replicate in a host, usually conferred by an origin of replication, and a 
selection gene to facilitate recognition of transformants. See, Sambrook et al., 
Molecular Cloning: A Laboratory Manual (2nd Edition, Cold Spring Harbor Press, 
New York, 1989). 

An expression vector comprises a polynucleotide operatively linked to a prokaryotic 
promoter. Alternatively, an expression vector is a polynucleotide operatively linked to 

further has a polyadenylation signal that is positioned 3' of the carboxy-terminat 
amino acid and within a transcriptional unit of the encoded polypeptide. A promoter is 
a region of a DNA molecule typically within about 500 nucleotide pairs in front of 
(upstream of) the point at which transcription begins (/.e., a transcription start site). In 
general, a vector contains a replicon and control sequences which are derived from 
species compatible with the host cell. The vector ordinarily carries a replication site, 
as well as marking sequences which are capable of providing phenotypic selection in 
transformed cells. 

Another type of discrete transcription regulatory sequence element is an enhancer. 
An enhancer provides specificity of time, location and expression level for a particular 
30 encoding region (e.g., gene). A major function of an enhancer is to increase the level 
of transcription of a coding sequence in a cell. 
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As used herein, the phrase "enhancer-promoter* means a composite unit that 
contains both enhancer and promoter elements. An enhancer-promoter is operatively 
linked to a coding sequence that encodes at least one gene product. 

An enhancer-promoter used in a vector construct of the present invention may be 
any enhancer-promoter that drives expression in a prokaryotic or eukaryotic cell to be 
transformed/transfected. 

A coding sequence of an expression vector is operatively linked to a transcription 
10 terminating region. RNA polymerase transcribes an encoding DNA sequence through 
a site where polyadenylation occurs. 

An expression vector comprises a polynucleotide that encodes a nuclear receptor 
L66 polypeptide. Such a polynucleotide is meant to include a sequence of nucleotide 
bases encoding a nuclear receptor L66 polypeptide sufficient in length to distinguish 
said segment from a polynucleotide segment encoding a non- nuclear receptor L66 
polypeptide. 

A polypeptide of the invention ma^ 
20 peptides which have variant amino acid sequences, such as with changes selected 
based on considerations such as the relative hydropathic score of the amino acids 
being exchanged. 

These variant sequences are those isolated from natural sources or induced in the 
sequences disclosed herein using a mutagenic procedure such as site-directed 
-mutagenesis. 

Furthermore, an expression vector of the present invention may contain regulatory 
elements for optimized translation of the polypeptide in prokaryotic or eukaryotic 
30 systems. This sequences are operatively located around the transcription start site 
and are most likely similar to ribosome recognition sites like prokaryotic ribosome 
binding sites (RBS) or eukaryotic Kozak sequences as known in the art (Kozak M., 
Initiation of translation in prokaryotes and eukaryotes. Gene 234, 187-208 (1999). 
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An expression vector of the present invention is useful both as a means for preparing 
quantities of the nuclear receptor L66 polypeptide-encoding DNA itself, and as a 
means for preparing the encoded nuclear receptor L66 polypeptide and peptides. It is 
contemplated that where nuclear receptor L66 polypeptides of the invention are 
made by recombinant means, one may employ either prokaryotic or eukaryotic 
expression vectors as shuttle systems. 

Where expression of recombinant nuclear receptor L66 polypeptides is desired and a 
eukaryotic host is contemplated, it is most desirable to employ a vector such as a 
1 0 plasmid, that incorporates a eukaryotic origin of replication. Additionally, for the 
purposes of expression in eukaryotic systems, one desires to position the nuclear 
receptor L66 encoding sequence or if desired parts thereof (SEQ ID NO. 1 , 4, 6, 8, 
10, 12, 17, and/or 22) adjacent to and under the control of an effective eukaryotic 
promoter. To bring a coding sequence under control of a promoter, whether it is 
eukaryotic or prokaryotic, what is generally needed is to position the & end of the 
translation initiation site of the proper translational reading frame of the polypeptide 
between about 1 and about 2000 nucleotides 3 1 of or downstream with respect to the 

20 Furthermore, where eukaryotic expression is anticipated, one would typically desire 
to incorporate into the transcriptional unit which includes the nuclear receptor L66 
polypeptide, an appropriate polyadenylation side. 

The invention provides homogeneous compositions of mammalian nuclear receptor 
L66 polypeptide produced by transformed prokaryotic or eukaryotic cells as provided 
herein. Such homogeneous compositions are intended to be comprised of 
mammalian nuclear receptor L66 protein that comprises at least 90% of the protein in 
such homogenous composition. The invention also provides membrane preparation 
from cells expressing mammalian nuclear receptor L66 polypeptide as the result of 
30 transformation with a recombinant expression construct, as described here. 

Within the scope of the present invention the terms recombinant protein or coding 
sequence both also include tagged versions of the protein depicted in SEQ ID NO. 3, 
1 9 and/or 22, and/or encoded by the nucleic acids according to the invention and 
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fusion proteins of said proteins or parts thereof such as splice variants with any other 
recombinant protein. Tagged versions here means that small epitopes of 3-20 amino 
acids are added to the original protein by extending the coding sequence either at 
the 5'orthe 3 'terminus leading to N-terminal or Oterminal extended proteins 
respectively, or that such small epitopes are included elsewhere in the protein. The 
same applies for fusion proteins where the added sequences are coding for longer 
proteins, varying between 2 and 100 kDa. Tags and fusion proteins are usually used 
to facilitate purification of recombinant proteins by specific antibodies or affinity 
matrices or to increase solubility of recombinant proteins within the expression host. 
10 Fusion proteins are also of major use as essential parts of yeast two hybrid screens 
for interaction partners of recombinant proteins. 

Tags used in the scope of the present invention may include but are not limited to the 
following: EEF (alpha Tubulin), B-tag (QYPALT), E tag (GAPVPYPDPLEPR) c-myc 
Tag (EQKLISEEDL), Flag epitope (DYKDDDDK, HA tag (YPYDVPDYA), 6 or 10 x 
His Tag, HSV (QPELAPEDPED), Pk-Tag (GKPIPNPLLGLDST), protein C 
(EDQVDPRLIDGK), 17 (MASMTGGQQMG), VSV-G (YTDIEMNRLGK), Fusion 

Protein (MBP), Cellulose Binding protein (CBD), chitin binding protein, ubiquitin, the 
20 Fc part of Immunoglobulins, and the IgG binding domain of Staphylococcus aureus 
protein A. These examples of course are illustrative and not limiting and the standard 
amino acid one letter code was used above. 

For expression of recombinant proteins in living cells or organisms, vector constructs 
harboring recombinant L66 nuclear receptor as set forth in SEQ ID NO. 1 or 17 are 
~ transformed ortransfected into appropriate host cells. Preferably, a recombinant host 
cell of the present invention is transfected with a polynucleotide of SEQ ID NO. 1 , 4, 
22 or 17. 

30 Means of transforming or transfecting cells with exogenous polynucleotide such as 
DNA molecules are well known in the art and include techniques such as calcium- 
phosphate- or DEAE-dextran-mediated transfection, protoplast fusion, 
electroporation, liposome mediated transfection, direct microinjection and vims 
infection (Sambrook et al M 1989). 
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The most frequently applied technique for transformation of prokaryotic cells is 
transformation of bacterial cells after treatment with calciumchloride to increase 
permeability (Dagert & Ehrlich, 1979), but a variety of other methods is also available 
for one skilled in the art. 

The most widely used method for transfection of eukaryotic cells is transfeclion 
mediated by either calcium phosphate or DEAE-dextran. Although the mechanism 
remains obscure, ft is believed that the transfected DNA enters the cytoplasm of the 
10 cell by endocytosis and is transported to the nucleus. Depending on the cell type, up 
to 90% of a population of cultured cells may be transfected at any one time. Because 
of its high efficiency, transfection mediated by calcium phosphate or DEAE-dextran is 
the method of choice for studies requiring transient expression of the foreign nucleic 
acid in large numbers of cells. Calcium phosphate-mediated transfection is also used 
to establish cell lines that integrate copies of the foreign DNA, which are usually 
arranged in head-to-tail tandem arrays into the host cell genome. 

.4n4he^^!^ 

numbers of copies of a plasmid of interest are mixed directly with cultured 
mammalian cells. After fusion of the cell membranes (usually with polyethylene 
glycol), the contents of the bacterium are delivered into the cytoplasm of the 
mammalian cells and the plasmid DNA is transported to the nucleus. Protoplast 
fusion is not as efficient as transfection for many of the cell lines that are commonly 
used for transient expression assays, but it is useful for cell lines in which 
endocytosis of DNA occurs inefficiently. Protoplast fusion frequently yields multiple 
copies of the plasmid DNA tandemly integrated into the host chromosome. 

The application of brief, high-voltage electric pulses to a variety of mammalian and 
plant cells leads to the formation of nanometer sized pores in the plasma membrane. 
30 DNA is taken directly into the cell cytoplasm either through these pores or as a 
consequence of the redistribution of membrane components that accompanies 
closure of the pores. Electroporation may be extremely efficient and may be used 
both for transient expression of cloned genes and for establishment of cell lines that 
carry integrated copies of the gene of interest. Electroporation, in contrast to calcium 
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phosphate-mediated transfection and protoplast fusion, frequently gives rise to cell 
lines that carry one, or at most a few, integrated copies of the foreign DNA. 

Liposome transfection involves encapsulation of DNA and RNA within liposomes, 
followed by fusion of the liposomes with the cell membrane. The mechanism of how 
DNA is delivered into the cell is unclear but transfection efficiencies may be as high 
as 90%. 

Direct microinjection of a DNA molecule into nuclei has the advantage of not 
10 exposing DNA to cellular compartments such as Jow-pH endosomes. Microinjection 
is therefore used primarily as a method to establish lines of cells that carry integrated 
copies of the DNA of interest 

The use of adenovirus as a vector for cell transfection is well known in the art. 
Adenovirus vector-mediated cell transfection has been reported for various cells 
(Stratford-Perricaudet et al M 1 992). 

A transfected cell may be prokaryotic or eukaryotic, transfection may be transient or 
stable. Where it is of interest to producelTM^Iengtfrh^ 
20 cultured mammalian mouse, or human cells are of particular interest 

In another aspect, the recombinant host cells of the present invention are prokaryotic 
host cells. In addition to prokaryotes, eukaryotic microbes, such as yeast may also be 
used illustrative examples for suitable cells and organisms for expression of 
recombinant proteins are belonging to but not limited to the following examples: 
- Insect cells, such as Drosophila Sf21, SF9 ceils or others, Expression strains of 
Escherichia coli, such as XL1 blue, BRL21, M15, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Hansenlua poiymorpha and Pichia pastoris strains, 
immortalized mammalian cell lines such as AtT-20, VERO and HeLa cells, Chinese 
30 hamster ovary (CHO) cell lines, and W138, BHK, COSM6, COS-7, 293 and MDCK 
cells, BHK-21 cells, Att 20HeLa cells, HeK 294, T47 D cells and others. 

Expression of recombinant proteins within the scope of this invention can also be 
performed in vitro. This may occur by a two step procedure, thereby producing first 
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mRNA by in vitro transcription of an apt polynucleotide construct followed by in vitro 
translation with convenient cellular extracts. These cellular extracts may be 
reticulocyte lysates but are not limited to this type. In vitro transcription may be 
performed by T7 or SP6 DNA polymerase or any other RNA polymerase which can 
recognize perse or with the help of accessory factors the promoter sequence 
contained in the recombinant DNA construct of choice. Alternatively one of the 
recently made available one step coupled transcription/translation systems may be 
used for in vitro translation of DNA coding for the proteins of this invention, e.g. from 
Roche Molecular Biochemicals. One illustrative but not limiting example for such a 
10 system is the TNT® T7 Quick System by Promega. 

Expression of recombinant proteins in transfected cell may occur constitutively or 
upon induction. Procedures depend on the Cell/vector combination used and are well 
known in the art. In all.cases, transfected cells are maintained for a period of time 
sufficient for expression of the recombinant L66 nuclear receptor protein. A suitable 
maintenance time depends strongly on the cell type and organism used and is easily 
ascertainable by one skilled in the art Typically, maintenance time is from about 2 

^~^4sQargiQL3b^ reasons and forsake of protein stability and 

solubility incubation temperatures during maintenance time may vary from 20°C to 42 

20 °C 

Recombinant proteins are recovered or collected either from the transfected cells or 
the medium in which those cells are cultured. Recovery comprises cell disruption, 
isolation and purification of the recombinant protein. Isolation and purification 
techniques for polypeptides are well-known in the art and include such procedures as 
precipitation, filtration, chromatography, electrophoresis and the like. 

In a preferred embodiment purification includes but is not limited to affinity 
purification of tagged or non-tagged recombinant proteins. This is a well established 
30 robust technique easily adapted to any tagged protein by one skilled in the art. For 
affinity purification of tagged proteins, small molecules such as gluthathione, maltose 
or chitin, specific proteins such as the IgG binding domain of Staphylococcus aureus 
protein A, antibodies or specific chelates which bind with high affinity to the tag of the 
recombinant protein are employed. For affinity purification of non-tagged proteins 
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specific monoclonal or polyclonal antibodies, which were raised against said protein, 
can be used. Alternatively immobilized specific interactors of said protein may be 
employed for affinity purification. Interactors include native or recombinant proteins 
as well as native or artificial specific low molecular weight ligands. 

CHEMICAL SYNTHESIS OF THE POLYPEPTIDE ACCORDING TO THE 
INVENTION: 

Alternatively, the protein itself may be produced using chemical methods to 
10 synthesize any of the amino acid sequences according to the invention or that is 
encoded by the nucleotide sequences according to the invention (SEQ ID NO. 1 , 4, 
17 or 22) and/or a portion thereof and/or splice variants thereof. For example, peptide 
synthesis can be performed using conventional Menifield solid phase f-Moc or t-Boc 
chemistry or various solid-phase techniques (Roberge, J. Y. et al. (1995) Science 
269: 202-204) and automated synthesis may be achieved, for example, using the ABl 
431 A Peptide Synthesizer (Perttn Elmer). The newly synthesized peptide(s) may be 

by preparative high performance liquid chromatography (e.g., 
Creighton, T. (1983) Proteins, Structures and Molecular PrinciplesTWH Freeirnan arSf 
Co., New York, N.Y.). The composition of the synthetic peptides may be confirmed 
20 by amino acid analysis or sequencing (e.g., the Edman degradation procedure; 

Creighton, supra). Additionally, the amino acid sequences according to the invention, 
i.e. SEQ ID NO. 3, 24 or 1 9 or the sequence that is encoded by SEQ ID NO. 1 , 4, 17 
or 22 or any part thereof, may be altered during direct synthesis and/or combined 
using chemical methods with sequences from other proteins, or any part thereof, to 
~" produce a variant polypeptide. 

SCREENING ASSAYS 

The invention also concerns a method for screening for agents which are capable of 
inhibiting the cellular function of the nuclear receptor L66 comprising the steps of 
30 contacting one or more candidate agents with a polypeptide according to the 

invention, removing unbound agent(s) and detecting whether the agent(s) interact 
with the polypeptid of the nuclear receptor. 
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The invention also concerns method for inhibiting the cellular function of the nuclear 
receptor L66, comprising the steps of contacting a cell with a binding agent of a 
polypeptide previously Identified as outlined herein whereby the cellular function of 
L66 is inhibited. 

Such a binding agent may be an antibody., RNA. an anti-sense oligonucleotide, a 
ribozyme or one of substances shown below or identified in a respective assay as 
disclosed herein. 

In still a further embodiment, the present invention concerns a method for identifying 
new nuclear receptor inhibitory or stimulatory substances, which may be termed as 
"candidate substances". It is contemplated that this screening technique proves 
useful in the general identification of compounds that serve the purpose of inhibiting 
or stimulating nuclear receptor activity. 

In one embodiment of the invention the following substances are disclosed as 
potential interactors of the nuclear receptor according to the invention: 



20 Steroids: dexamethasone-t-butyiacetate, RU486, progesterone, 17-alpha- 
hydroxypnogesterone, 1,16-alpha dimethylpregnenolone, 17-alpha- 
hydroxypregnenonlone, pregnenolone, 5beta-pregnane-3 f 20-dione, pregnenonlone- 
16-carbonitrile, 5beta-prbgnane-3,20-dione, androstanoi, corticosterone, 
dehydroepiandrosterone, dihydroxytestosterone, estradiol, Cortisol, cortisone, 
dihydroxytestosterone. 

Other substances: transnonach\or, chlordane, spironolactone, cyproterone acetate, 
rifampicin, nefipine, diethylstilbestrol, coumesterol, clotrimazole, lovastatin, 
phenoarbital, pthalic acid, nonylphenol, 1,4-bis(2-(3,5-dichloropyridyloxy1))benzene, 

30 

This also includes the use of heteromultimeric complexes of the nuclear receptor with 
other proteins, such as heterodimeric complexes with RXR, or any other binding 
partner. 
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Accordingly, in screening assays to identify pharmaceuticals agents which affect 
nuclear receptor activity, it is proposed that compounds isolated from natural 
sources, such as fungal extracts, plant extracts, bacterial extracts, higher eukaryotic 
cell extracts, or even extracts from animal sources, or marine, forest or soil samples, 
may be assayed for the presence of potentially useful pharmaceutical agents. 

It will be understood that that the pharmaceutical agents to be screened could also 
be derived from chemical compositions or man-made compounds. The candidate 
substances can could also include monoclonal or polyclonal antibodies, peptides or 
10 proteins, such as those derived from recombinant DNA technology or by other 
means, including chemical peptide synthesis. The active compounds may include 
fragments or parts or derivatives of naturally-occurring compounds or may be only 
found as active combinations of known compounds which are otherwise inactive. We 
anticipate that such screens will in some cases lead to the isolation of agonists of 
nuclear receptors, in other cases to the isolation of antagonists. In other instances, 
substances will be identified that have mixed agonistic and antagonistic effects, or 
affect nuclear receptors in any other way. 



CELL BASED ASSAYS 

20 

To identify a candidate substance capable of influencing L66 nuclear receptor 
activity, one first obtains a recombinant cell line. One designs the cell line in such a 
way that the activity of the nuclear receptor leads to the expression of a protein which 
has an easily detectable phenotype ( a reporter), such as luciferase, fluorescent 
proteins such as green or red fluorescent protein, beta-galactosidase, alpha- 
galactosidase, beta-lactamase, chloramphenicol-acetyl-transferase, beta- 
glucuronidase, or any protein which can be detected by a secondary reagent such as 
an antibody. 

30 Methods for detecting proteins using antibodies, such as ELISA assays, are well 
known to those skilled in the art. 

Here, the amount of reporter protein present reflects the transcriptional activity of the 
nuclear receptor. This recombinant cell line is then screened for the effect of 
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substances on the expression of the reporters, thus measuring the ffect of these 
substances on the activity of the nuclear receptor. These substances can be derived 
from natural sources, such as fungal extracts, plant extracts, bacterial extracts, 
higher eukaryotic cell extracts, or even extracts from animal sources, or marine, 
forest or soil samples, may be assayed for the presence of potentially useful 
pharmaceutical agents. It will be understood that that the pharmaceutical agents to 
be screened may be derived from chemical compositions or man-made compounds. 

The candidate substances can also include monoclonal or polyclonal antibodies, 
10 peptides or proteins, such as those derived from recombinant DNA technology or by 
other means, including chemical peptide synthesis. The active compounds may 
include fragments or parts or derivatives of naturally-occurring compounds or may be 
only found as active combinations of known compounds which are otherwise 
inactive. 

In general the assay comprises, contacting a suitable cell containing a reporter under 
the control of the L66 nuclear receptor with a test compound, monitoring said host 
cell for the expression of the reporter gene, wherein expression of the reporter 
reflects the transcriptional activity of the nuclear receptor L66, and therefore reflects 
effects of the compound on the nuclear receptor. 

In other embodiments of the invention assays are included where measuring the 
activity of a dimer of the nuclear receptor L66 and another protein, such as RXR 
takes place. Further included are assays aiming at the identification of compounds 
which specifically influence only the rnonomeric, homodimeric or homomultimeric 
- form of the nuclear receptor, or influencing only multimeric forms of the nuclear 
receptor. Such assays include measuring the effect of a compound on the nuclear 
receptor in the absence of a binding partner, and measuring the effect of a 
compound on the nuclear receptor in the presence of a binding partner, such as 
30 RXR. One skilled in the art will find numerous more assays which are equally 
covered by the invention. 

A cell line where the activity of a nuclear receptor determines the expression of a 
reporter can be obtained by creating a fusion gene driving the expression of a fusion 
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protein consisting of the ligand-binding domain of the L66 nuclear receptor fused to 
the DNA binding domain of a transcription factor with a known specificity for a given 
DNA sequence (the binding site). This DNA sequence (the binding site) can then be 
inserted in one or multiple copies before (5') to the promoter driving the expression of 
the reporter. Transcription factors useful for this approach include bacterial proteins, 
such as lexA, yeast proteins, such as Gal4, mammalian proteins such as NFkappaB 
or NFAT, the glucocorticoid receptor, the estrogen receptor, or plant proteins. The 
binding sites for these proteins can all be used in combination with the appropriate 
transcription factor to generate a useful reporter assay. 

Another way to screen for inhibitors is to identify binding sites on DNA for the L66 
nuclear receptor, and operatively link this binding site to a promoter operatively linked 
to a reporter gene. Included among others are binding sites for heterodimers of the 
L66 nuclear receptor with another protein, such as RXR. 

Furthermore, transgenic animals described in the invention can be used to derive ceil 
lines useful for cellular screening assays. 



Cell lines useful for such an assay include many different kinds of cells, including 
20 prokaryotic, animal, fungal, plant and human cells. Yeast cells can be used in this 
assay, including Saccharomyces cerevisiae and Schizosaccharvmyces pombe cells. 

Another way to build cellular assays to measure the effect of compounds is the use 
of the yeast two hybrid system (see for example see, for example, U.S. Pat. No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 
268:12046-12054; Bartel et al. (1993) Biotechniques 14:920-924; Iwabuchi et al. 
(1993) Oncogene 8:1693-1696; PCT Publication No. WO 94/10300, and U.S. Pat 
No. 5,667,973), and or possible variants of the basic two hybrid system as discussed 
e.g in Vidal M, Legrain P, Nucleic Acids Res. 1999 Feb 15;27(4):919-29. Briefly, the 
30 two hybrid assay relies on reconstituting in vivo a functional transcriptional activator 
protein from two separate fusion proteins. In particular, the method makes use of 
chimeric genes which express hybrid proteins. To illustrate, a first hybrid gene 
comprises the coding sequence for a DNA-binding domain of a transcriptional 
activator fused in frame to the coding sequence for a Tl polypeptide. The second 
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hybrid protein encodes a transcriptional activation domain fused in frame to a sample 
gene from a cDNA library. If the bait and sample hybrid proteins are able to interact, 
e.g., form a Tl-dependent complex, they bring into close proximity the two domains of 
the transcriptional activator. This proximity is sufficient to cause transcription of a 
reporter gene which is operably linked to a transcriptional regulatory site responsive 
to the transcriptional activator, and expression of the reporter gene can be detected 
and used to score for the interaction of the Tl and sample proteins. 

In such assays, one primarily measures the effect of a compound on a given 
1 0 interaction involving the L66 nuclear receptor and a binding protein. In a preferred 
embodiment of the invention systems using other hosts such as prokaryotes as £. 
co// f or eukaryotic mammalian cells are described. 

Two hybrid systems using hybrid protein fusions with other proteins than transcription 
factors, including enzymes such as beta-galactosidase or dihydrofolate reductase 
may also be applied. These assays are useful both to monitor the effect of a 
compound, including peptides, proteins or nucleic acids on an interaction of a nuclear 
receptor with a given binding partner, as well as to identify novel proteins or nucleic 
addslntera^ 

Monitoring the influence of compounds on cells may be applied not only in basic drug 
screening, but also in clinical trials. In such clinical trials, the expression of a panel of 
genes may be used as a "read out" of a particular drug's therapeutic effect. 

CELL-FREE ASSAYS 

Recombinant forms of the polypeptide according to SEQ ID NO. 3, 24 or 19 or as 
encoded by the nucleic acids according to the invention can be used in cell-free 
screening assays aiming at the isolation of compounds affecting the activity of 
30 nuclear receptors. In such an assay, the nuclear receptor polypeptide is brought into 
contact with a substance to test if the substance has an effect on the activity of the 
L66 receptor. 
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The detection of an interaction between an agent and a receptor may be 
accomplished through techniques well-known in the art These techniques include 
but are not limited to centrifugation, chromatography, electrophoresis and 
spectroscopy. The use of isotopically labeled reagents In conjunction with these 
techniques or alone is also contemplated. Commonly used radioactive isotopes 
include 3 H f 14 C, . ^Na, ^P, ^P, ^S, ^Ca, ^Co, 125 l, and 131 l. Commonly used 
stable isotopes include * 2 H, . 13 C, 15 N, 18 0. 

For example, if an agent binds to the receptor of the present invention, the binding 
10 may be detected by using radiolabeled agent or radiolabeled receptor. Briefly, if 
radiolabeled agent or radiolabeled receptor is utilized, the agent-receptor complex 
may be detected by liquid scintillation or by exposure to x-ray film or phosho-imaging 
devices. 

One way to screen for substances affecting nuclear receptor activity is to measure 
the effect of the binding of nuclear receptors to ligands, such as cofactors, activators, 
repressors, DNA, RNA, proteins, antibodies, peptides or other substances, including 

of a protein to a ligand are well known in the art, such as ELISA assays, FRET 
20 assays, bandshift assays, plasmon-resonance based assays, scintilllation proximity 
assays, fluorescence polarization assays. 

In one example, a mixture containing the L66 polypeptide, effector and candidate 
substance is allowed to incubate. The unbound effector is separable from any 
effector/receptor complex so formed. One then simply measures the amount of each 
(e.g., versus a control to which no candidate substance has been added). This 
measurement may be made at various time points where velocity data is desired. 
From this, one determines the ability of the candidate substance to alter or modify the 
function of the receptor. 

30 

Numerous techniques are known for separating the effector from effector/receptor 
complex, and all such methods are intended to fall within the scope of the invention. 
This includes the use of thin layer chromatographic methods (TLC), HPLC, 
spectrophotometric, gas chromatographic/mass spectrophotometric or NMR 
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analyses. Another method of separation is to immobilize one of the binding partners 
on a solid support, and to wash away any unbound material. It is contemplated that 
any such technique may be employed so long as it is capable of differentiating 
between the effector and complex, and may be used to determine enzymatic function 
such as by identifying or quantifying the substrate and product. 

A screening assay provides a L66 receptor under conditions suitable for the binding 
of an agent to the L66 receptor. These conditions include but are not limited to pH, 
temperature, tonicity, the presence of relevant cofactors, and relevant modifications 
10 to the polypeptide such as glycosylation or lipidation. It is contemplated that the 
receptor can be expressed and utilized in a prokaryotic or eukaryotic cell. The host 
cell expressing the L66 receptor can be used whole or the receptor can be isolated 
from the host cell. The L66 receptor can be membrane bound in the membrane of the 
host cell or it can be free in the cytosol of the host cell. The host cell can also be 
fractionated into sub-cellular fractions where the receptor can be found. For example, 
cells expressing the receptor can be fractionated into the nuclei, the endoplasmic 
reticulum, vesicles, or the membrane surfaces of the cell. 

pH is preferably from about a value of 6.0 to a value of about 8.0, more preferably 
20 from about a value of about 6.8 to a value of about 7.8, and most preferably, about 
7.4. In a preferred embodiment, temperature is from about 20°C degrees to about 
50°C degrees more preferably, from about 30°C degrees to about 40°C degrees and 
even more preferably about 37°C degrees. Osmolality is preferably from about 5 
milliosmols per liter (mosmlL) to about 400 mosm/l, and more preferably, from about 
200 milliosmols per liter to about 400 mosm/l and, even more preferably from about 
290 mosm/L to about 310 mosm/L The presence of cofactors can be required for the 
proper functioning of the L66 receptor. Typical cofactors include sodium, potassium, 
calcium, magnesium, and chloride. In addition, small, non-peptide molecules, known 
as prosthetic groups may also be required. Other biological conditions needed for 
30 receptor function are well-known in the art. 

It is well-known in the art that proteins can be reconstituted in artificial membranes, 
vesicles or liposomes. (Danboldt et al.,1990). The present invention contemplates 
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that the receptor can be incorporated into artificial membranes, vesicles or 
liposomes. The reconstituted receptor can be utilized in screening assays. 

It is further contemplated that a receptor of the present invention can be coupled to a 
solid support, e.g., to agarose beads, polyacrylamide beads, polyacrylic, sepharose 
beads or other solid matrices capable of being coupled to polypeptides. Well-known 
coupling agents include cyanogen bromide (CNBr), carbonyldiimidazole, tosyl 
chloride, diaminopimelimidate, and glutaraldehyde. 

10 In a typical screening assay for identifying candidate substances, one employs the 
same recombinant expression host as the starting source for obtaining the receptor 
polypeptide, generally prepared in the form of a crude homogenate. Recombinant 
cells expressing the receptor are washed and homogenized to prepare a crude 
polypeptide homogenate in a desirable buffer such as disclosed herein. In a typical 
assay, an amount of polypeptide from the cell homogenate, is placed into a small 
volume of an appropriate assay buffer at an appropriate pH. Candidate substances, 
such as agonists and antagonists, are added to the admixture in convenient 

-^^^^ substanc e and the receptor 

polypeptide is monitored. 

20 

Where one uses an appropriate known substrate for the L66 receptor, one can, in the 
foregoing manner, obtain a baseline activity for the recombinantly produced L66 
receptor. Then, to test for inhibitors or modifiers of the receptor function, one can 
incorporate into the admixture a candidate substance whose effect on the L66 
receptor is unknown. By comparing reactions which are carried out in the presence 
or absence of the candidate substance, one can then obtain information regarding 
the effect of the candidate substance on the normal function of the receptor. 

Accordingly, this aspect of the present invention will provide those of skill in the art 
30 with methodology that allows for the identification of candidate substances having the 
ability to modify the action of nuclear receptor polypeptides in one or more manners. 

Additionally, screening assays for the testing of candidate substances are designed 
to allow the determination of structure-activity relationships of agonists or antagonists 
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with the receptors, e.g., comparisons of binding between naturally-occurring 
hormones or other substances capable of interacting with or otherwise modulating 
the receptor; or comparison of the activity caused by the binding of such molecules to 
the receptor. 

In certain aspects, the polypeptides of the invention are crystallized in order to carry 
out x-ray crystallographic studies as a means of evaluating interactions with 
candidate substances or other molecules with the nuclear receptor polypeptide. For 
instance, the purified recombinant polypeptides of the invention, when crystallized in 

10 a suitable form, are amenable to detection of intra-molecular interactions by x-ray 

.* 

crystallography. In another aspect, the structure of the polypeptides can be 
determined using nuclear magnetic resonance. 

PHARMACEUTICAL COMPOSITION: 

This invention provides a pharmaceutical composition comprising an effective 
amount of a agonist or antagonist drug identified by the method described herein and 
a pharmaceutical^ acceptable carrier. SuclTd^ 

by various routes, for example oral, subcutaneous, intramuscular, intravenous or 
20 intracerebral. The preferred route of administration would be oral at daily doses of 
about 0.01 -100 mg/kg. 

This invention provides a method of treating metabolic disorders, immunological 
indications, hormonal dysfunctions, neurosystemic diseases wherein the abnormality 
- is improved by reducing the activity of L66 receptor or blocking the binding of ligands 
to a L66 receptor, which method comprises administering an effective amount of the 
antagonist-containing pharmaceutical composition described above to suppress the 
subject's appetite. Similarly, the invention also provides methods for treating 
diseases and conditions resulting from metabolic disorders, immunological 
30 indications, hormonal dysfunctions, neurosystemic diseases, which method 

comprises administering an effective amount of an agonist-containing pharmaceutical 
composition described above. 
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TRANSFORMATION OF CELLS AND DRUG SCREENING : 

The recombinant expression constructs of the present invention are useful in 
molecular biology to transform cells which do not ordinarily express L66 to 
thereafter express this receptor. 

Such cells are useful as intermediates for making cellular preparations useful for 
receptor binding assays, which are in turn useful for drug screening. Drugs identified 
from such receptor assays can be used for the treatment of metabolic disorders, 
10 immunological indications, hormonal dysfunctions, and/or neurosystemic diseases. 

The recombinant expression constructs of the present invention are also useful in 
gene therapy. Cloned genes of the present invention, or fragments thereof, may also 
be used in gene therapy carried out by homologous recombination or site-directed 
mutagenesis. See generally Thomas & Capecchi, Cell 51, 503-512 (1987); Bertling, 
Bioscience Reports 7, 107-1 12 (1987); Smithies et aL, Nature 317, 230-234 (1985). 

tools for probing 

L66 expression in tissues. For example, tissues are probed in situ with 
20 oligonucleotide probes carrying detectable groups by conventional autoradiographic 
techniques, as explained in greater detail in the Examples below, to investigate 
native expression of this receptor or pathological conditions relating thereto. Further, 
chromosomes can be probed to investigate the presence or absence of the L66, and 
potential pathological conditions related thereto, as also illustrated by the Examples 
below. Probes according to the invention should generally be at least about 15 
nucleotides in length to prevent binding to random sequences, but, under the 
appropriate circumstances may be smaller (see above for details on hybridization). 

ANTIBODIES AGAINST THE L66 NUCLEAR RECEPTOR PROTEIN OR 
30 POLYPEPTIDE 

Another aspect of the invention includes an antibody specifically reactive with the 
protein or any part of the protein according to the invention (SEQ ID NO. 3, 24 or 19) 
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and or a polypeptide encoded by the nucleotide sequence of the nuclear receptor 
L66 (see also figures). (The term antibody" refers to intact molecules as well as 
fragments thereof, such as Fa, F(ab).sub.2, and Fv, which are capable of binding the 
epitopic determinant) By using immunogens derived from the polypeptide according 
to the invention (SEQ ID NO. 3, 24, 19) and/or encoded by the nucleic acids 
according to the invention, anti-protein/anti-peptide antisera or monoclonal antibodies 
can be made by standard protocols (E. Howell & D. Lane. Antibodies: A Laboratory 
Manual. Cold Spring Harbor Laboratory (1988)). 

10 A polyclonal antibody is prepared by immunizing a mammal, such as a mouse, a 
hamster or rabbit with an immunogenic form of the polypeptide, i.e. the human L66 
polypeptide of the present Invention, and collecting antisera from that immunized 
animal. Because of the relatively large blood volume of rabbits, a rabbit is a preferred 
choice for production of polyclonal antibodies. 

As an immunizing antigen, fusion proteins, intact polypeptides or fragments 
containing small peptides of interest can be used. They can be derived by expression 

—-—Jtaniai^ with subsequent recovering of the 

protein/peptide or peptides can be synthesized chemically (e.g. oligopeptides with 

20 10-15 residues in length). Important tools for monitoring the function of the gene 
according to the present invention, i.e. encoded by a sequence according to SEQ ID 
NO. 1 , 4, 24 or 17 (or portions thereof or splice variants thereof) are antibodies 
against various domains of the protein according to the invention. Various 
Oligopeptides from the N- and C-terminal sequences and the DBD/hinge region of 
the protein can be used as antigens. 

A given polypeptide or polynucleotide may vary in its immunogenicity. It is often 
necessary to couple the immunogen (e.g. the polypeptide) wfth a carrier. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin 
30 (BSA) and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to 
immunize the animal in the presence of an adjuvant, a non-specific stimulator of the 
immune response in order to enhance immunogenicity. The production of polyclonal 
antibodies is monitored by detection of antibody titers in plasma or serum at various 
time points following immunization. Standard ELISA or other immunoassays can be 
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used with the immunogen as antigen to assess the lev Is of antibodies. When a 
desired level of immunogenicity is obtained, the immunized animal may be bled and 
the serum isolated, stored and purified. 

To produce monoclonal antibodies, antibody-producing cells (e.g. spleen cells) from 
an immunized animal (preferably mouse or rat) are fused by standard somatic cell 
fusion procedures with immortalizing cells such as myeloma cells to yield hybridoma 
ceils. Where the immunized animal is a mouse, a preferred myeloma cell is the 
murine NS-1 myeloma cell. Such techniques are well known in the art, and include, 
10 for example, the hybridoma technique (originally developed by Kohler & Milstein. 
Nature 256: 495-497 (1975)), the human B cell hybridoma technique (Kozbar et al. 
Immunology Today 4:72 (1983)), and the EBV-hybridoma technique to produce 
human monoclonal antibodies (Cole et al Monoclonal Antibodies and Cancer 
Therapy. Alan R Liss, Inc. pp. 77-96 (1985)). 

The fused spleen/myeloma cells are cultured in a selective medium to select fused 
spleen/myeloma cells from the parental cells. Fused cells are separated from the 

the de novo synthesis of nucleotides in the tissue culture media. This culturing 
20 provides a population of hybridomas from which specific hybridomas are selected. 
Typically, selection of hybridomas is performed by culturing the cells by single-clone 
dilution in microtiter plates, followed by testing the individual clonal supematants for 
reactivity with an antigen-polypeptide. The selected clones may then be propagated 
indefinitely to provide the monoclonal antibody in convenient quantity. 

The creation of antibodies which specifically bind the polypeptide according to the 
invention (SEQ ID NO. 3, 24 or 19) and/or encoded by the nucleotide sequence of 
the nuclear receptor L66 or its complement (SEQ ID NO. 1 , 4, 17 or 22) provides an 
important utility in immunolocalization studies, and may play an important role in the 
30 diagnosis and treatment of receptor disorders. The antibodies may be employed to 
identify tissues, organs, and cells which express or the nuclear receptor L66. 
Antibodies can be used diagnostically in immuno-precipitation and immuno-blotting to 
detect and evaluate nuclear receptor L66 protein levels in tissue or from cells in 
bodily fluid as part of a clinical testing procedure. 
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Monoclonal antibodies provided by the present invention are also produced by 
recombinant genetic methods well known to those of skill in the art, and the present 
invention encompasses antibodies made by such methods that are immunologically 
reactive with an epitope of a mammalian nuclear L66 receptor protein or peptide. 

The present invention encompasses fragments of the antibody that are 
immunologically reactive with an epitope of a mammalian nuclear L66 receptor 
protein or peptide. Such fragments are produced by any number of methods, 
10 including but not limited to proteolytic cleavage, chemical synthesis or preparation of 
such fragments by means of genetic engineering technology. The present invention 
also encompasses single-chain antibodies that are immunologically reactive with an 
epitope of a mammalian nuclear L66 receptor protein or peptide made by methods 
known to those of skilled in the art. 

CHIMERIC ANTIBODIES AND OTHER TYPES OF ANTIBODIES: 

chain peptides immunologically reactive to an epitope that is a mammalian nuclear 
20 L66 receptor protein or peptide. The chimeric antibodies embodied in the present 
invention include those that are derived from naturally occurring antibodies as well as 
chimeric antibodies made by means of genetic engineering technology well known to 
those of skill in the art. 

Also included are methods for the generation of antibodies against L66 which rely on 
the use of phage display systems and related systems, such as described in 
Hoogenboom HR, de Bruine AP, Hufton SE, Hoet RM, Arends JW, Roovers RC, 
Immunotechnology 1998 Jun;4(1):1-20, and references therein. 

30 EPITOPES OF THE L66 NUCLEAR RECEPTOR 

The present invention also encompasses an epitope of a mammalian nuclear L66 
receptor protein or peptide that is comprised of sequences and/or a conformation of 
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sequences present in the mammalian nuclear L66 receptor protein or peptide 
molecule. This epitope may be naturally occurring, or may be the result of proteolytic 
cleavage of the mammalian nuclear L66 receptor protein or peptide molecule and 
isolation of an epitope-containing peptide or may be obtained by synthesis of an 
epitope-containing peptide using method of genetic engineering technology and 
synthesized by genetically engineered prokaryotic or eukaryotic cells. 

ANTISENSE OLIGONUCLEOTIDES AGAINST L66 

10 Antisense oligonucleotides are short single stranded DNA or RNA molecules which 
may be used to block the availability of the L66 receptor messenger Synthetic 
derivatives of ribonucleotides or deoxyribonucleotides and/or PNAs (see above) are 
equally possible. 

The sequence of an antisense oligonucleotide is at least partially complementary to 
the sequence (or the gene) of interest. The complementarity of the sequence is in 

any case high enough to enable the antisense oligonucleotide to bind to the nucleic 

acid according to the invention or parts thereof.. Many examples exist in whichlhir^ 
binding of oligonucleotides to the target sequence interfere with the biological 

20 function of the targeted sequence (Brysch W, Schlingensiepen KH, Design and 
application of antisense oligonucleotides in cell culture, in vivo, and as therapeutic 
agents, Cell Mol Neurobiol 1994 Oct;14(5):557-68; Wagner RW, Gene inhibition 
using antisense oligodeoxynucleotides, Nature 1994 Nov 24;372(6504):333-5 or 
Brysch W, Magal E, Louis JC, Kunst M, Klinger I, Schlingensiepen R, 
~~ Schlingensiepen KH Inhibition of p185c-erbB-2 proto-oncogene expression by 
antisense oligodeoxynucleotides down-regulates p185-associated tyrosine-kinase 
activity and strongly inhibits mammary tumor-cell proliferation, Cancer Gene Ther 
1994 Jun;1(2):99-105 or Monia BP, Johnston JF, Ecker DJ, Zounes MA, Lima WF, 
Freier SM Selective inhibition of mutant Ha-ras mRNA expression by antisense 

30 oligonucleotides, J Biol Chem 1992 Oct 5;267(28):1 9954-62 or Bertram J, Palfher K, 
Killian M, Brysch W, Schlingensiepen KH, Hiddemann W, Kneba M, Reversal of 
multiple drug resistance in vitro by phosphorothioate oligonucleotides and ribozymes, 
Anticancer Drugs 1995 Feb;6(1):124-34) 
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This interference occurs in most instances at the level of translation, i.e. through the 
inhibition of th translation^ machinery by oligonucleotides that bind to mRNA, 
however, two other mechanisms of interference with a given gene's function by 
oligonucleotides can also be envisioned, (i) the functional interference with the 
transcription of a gene through formation of a triple helix at the level of genomic DNA 
and the interference of oligonucleotides with the function of RNA molecules that are 
executing at least part of their biological function In the untranslated form 
(Kochetkova M, Shannon MF, Triplex-forming oligonucleotides and their use in the 
analysis of gene transcription. Methods Mol Biol 2000;130:189-201 Rainer B. Lanzl , 
1 0 Neil J. McKennal , Sergio A. Onatel , Urs Albrecht2, Jiemin Wongl , Sophia Y. Tsail , 
Ming-Jer Tsail, and Bert W. O'Malley A Steroid Receptor Coactivator, SRA, 
Functions as an RNA and Is Present in an SRC-1 Complex Cell, Vol. 97, 17-27, 
April, 1999). 

Antisense oligonucleotides can be conjugated to different other molecules in order to 
deliver them to the cell or tissue expressing L66. For instance the antisense 
oligonucleotide can be conjugated to a carrier protein (e.g. ferritin) in order to direct 
the oligonucleotide towards the desired target tissue, i.e. in case of ferritin 



predominantly to the liver. 



20 



Antisense expression constructs are expression vector systems that allow the 
expression - either inducible or uninducible - of a complementary sequence to the 
L66 sequences according to the invention. The potential possibility of such an 
approach has been demonstrated in many different model systems (von Ruden T, 
Gilboa E, Inhibition of human T-cell leukemia virus type I replication in primary 
- human T cells that express antisense RNA, J Virol 1989 Feb;63(2):677-82; Nemir M, 
Bhattacharyya D, Li X, Singh K, Mukherjee AB, Mukherjee BB, Targeted inhibition of 
osteopontin expression in the mammary gland causes abnormal morphogenesis and 
lactation deficiency, J Biol Chem 2000 Jan 14;275(2):969-76; Ma L, Gauville C, 
30 Berthois Y, Millot G, Johnson GR, Calvo F.Antisense expression for amphiregulin 
suppresses tumorigenicity of a transformed human breast epithelial cell line, 
Oncogene 1999 Nov 11;18(47):6513-20; Refolo LM, Eckman C, Prada CM, Yager D, 
Sambamurti K, Mehta N, Hardy J, Younkin SG, Antisense-induced reduction of 
presenilin 1 expression selectively increases the production of amyloid beta42 in 
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transfected cells, J Neurochem 1999 Dec;73(6):2383-8; Buckley NJ, Abogadie FC, 
Brown DA, Dayrell M, Caulfield MP, Delmas P, Haley JE, Use of antisense 
expression plasmids to attenuate G-protein expression in primary neurons, Methods 
Enzymol 2000;314:136^8). 



According to the invention an antisense expression construct can be constructed with 
virtually any expression vector capable of fulfilling at least the basic requirements 
known to those skilled in the art. 

10 In one embodiment of the invention retroviral expression systems or tissue specific 
gene expression systems are preferred. 



Current standard technologies for delivering antisense constructs are performed 
through a conjugation of constructs with liposomes and related, complex-forming 
compounds, which are transferred via electroporation techniques or via particle- 
mediated "gene gun" technologies into the cell. Other techniques may be envisioned 
by one skilled in the art 



Microinjection still plays a major role in most gene transfer techniques for the 
20 generation of germ-line mutants expressing foreign DNA (including antisense RNA 
constructs) and is preferred embodiment of the present invention. 



RIBOZYMES DIRECTED AGAINST L66 

~~ Ribozymes are either RNA molecules (Gibson SA, Pellenz C, Hutchison RE, Davey 
FR, Shiliitoe EJ, Induction of apoptosis in oral cancer cells by an anti-bcl-2 ribozyme 
delivered by an adenovirus vector, Clin Cancer Res 2000 Jan;6(l):213-22; Folini M, 
Colella G, Villa R, Lualdi S, Daidone MG, Zaffanoni N, Inhibition of Telomerase 
Activity by a Hammerhead Ribozyme Targeting the RNA Component of Telomerase 
30 in Human Melanoma Cells, J Invest Dermatol 2000 Feb;1 14(2):259-267; Halatsch 
ME, Schmidt U, Botefur IC, Holland JF, Ohnuma T, Marked inhibition of glioblastoma 
target cell tumorigenicity in vitro by retrovirus-mediated transfer of a hairpin ribozyme 
against deletion-mutant epidermal growth factor receptor messenger RNA, J 
Neurosung 2000 Feb;92(2):297-305; Ohmichi T, Kool ET, The virtues of self-binding: 
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high sequence specificity for RNA cleavage by self-processed hammerhead 
ribozymes, Nucleic Acids Res 2000 Feb 1;28(3):776-783) or DNA molecules (Li J, 
Zheng W, Kwon AH, Lu Y, In vitro selection and characterization of a highly efficient 
Zn(ll)-dependent RNA-cleaving deoxyribozyme; Nucleic Acids Res 2000 Jan 
15;28(2):481-488) that have catalytic activity. The catalytic activity located in one part 
of the RNA (or DNA) molecule can be "targeted" to a specific sequence of interest by 
fusing the enzymatically active RNA molecule sequence with a short stretch of RNA 
(or DNA) sequence that is complementary to the L66 transcript Such a construct will, 
when introduced into a cell either physically or via gene transfer of a ribozyme 
1 0 expression construct find the L66 sequence (our sequence of interest) and bind via 
its sequence-specific part to said sequence. The catalytic activity attached to the 
construct, usually associated with a special nucleic acid structure (people distinguish 
so called "hammerhead" structures and "hairpin" structures), will then cleave the 
targeted RNA. The targeted mRNA will be destroyed and cannot be translated 
efficiently, thus the protein encoded by the mRNA derived from L66 will not be 
expressed or at least will be expressed at significantly reduced amounts. 

In a preferred embodiment the invention covers inducible ribozyme constructs 

20 respond to the second messengers cGMP and cAMP, Nat Struct Biol 1999 
Nov;6(11):1062-1071). 

In a further preferred embodiment the invention concerns the use of "bivalent" 
ribozymes (multimers of catalytically active nucleic acids) as described in (Hamada 
M, Kuwabara T, Warashina M, Nakayama A, Taira K, Specificity of novel 
_ allostericaliy trans- and cis-activated connected maxizymes that are designed to 
suppress BCR-ABL expression FEBS Lett 1999 Nov 12;461(1-2):77-85). 

TRANSGENIC ANIMALS CARRYING THE L66 NUCLEAR RECEPTOR 

30 

Also provided by the present invention are non-human transgenic animals grown 
from germ cells transformed with the L66 nucleic acid sequence according to the 
invention and that express the L66 receptor according to the invention and offspring 
and descendants thereof. Also provided are transgenic non-human mammals 
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comprising a homologous recombination knockout of the native L66 receptor, as well 
as transgenic non-human mammals grown from germ cells transformed with nucleic 
acid antisense to the L66 nucleic acid of the invention and offspring and descendants 
thereof. Further included as part of the present invention are transgenic animals 
which the native L66 receptor has been replaced with the human homolog. Of 
course, offspring and descendants of all of the foregoing transgenic animals are also 
encompassed by the invention. 

Transgenic animals according to the invention can be made using well known 
10 techniques with the nucleic acids disclosed herein. E.g. f Leder et al., U.S. Patent 
Nos.4,736,866 and 5,175,383; Hogan et al., Manipulating the Mouse Embryo, A 
Laboratory Manual (Cold Spring Harbor Laboratory (1986)); Capecchi, Science 244, 
1288 (1989); Zimmer and Gruss, Nature 338, 150 (1989); Kuhn et al., Science 269, 
1427 (1995); Katsuki et al., Science 241, 593 (1988); Hasty et al., Nature 350, 243 
(1991); Stacey et al., Mol. Cell Biol. 14, 1009 (1994); Hanks et al., Science 269, 679 
(1995); and Marx, Science 269, 636 (1995). Such transgenic animals are useful for 
screening for and determining the physiological effects of L66 receptor agonists and 
antagonist. 



20 Consequently, such transgenic animals are useful for developing drugs to regulate 
physiological activities in which L66 participates. 

The following examples are provided for illustrative purposes only and are not 
intended, nor should they be construed, as limiting the invention in any manner. 

MODELLING OF THE STRUCTURE OF L66 

The novel nuclear receptor sequences disclosed herein may be used for various in 
silico, i.e. computer analyses. Such analyses may be for example nuclear receptor 
30 specific sequence alignments which permit the identification of domains and even 
new receptors. The novel domain sequences disclosed herein may be used in order 
to create domain specific hidden markov models (hmms) or simply as search 
sequences. 
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In a preferred embodiment this similarity search tool is the BLAST algorithm^ 
Altschul, Stephen F., Warren Gish, Webb Miller, Eugene W. Myers, and David J. 
Lipman (1990). Basic local alignment search tool. J. Mol. Biol. 215:403-10 and the 
sequence used is one of those disclosed herein. 

Another search tool that may be used is FASTA (W. R, Pearson and D. J. Lipman 
(1988), "Improved Tools for Biological Sequence Analysis", PNAS 85:2444- 2448, 
and W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison with FASTP 
10 and FASTA" Methods in Enzymology 183:63- 98). 

The invention is not limited to one particular type of search tool. In one embodiment 
of the invention search tools are used that do not search by sequence similarity but 
by applying sequence profiles such as a profile generated when applying the Profile 
Hidden Markov Model. 

Profile Hidden Markov Models also called „Hidden Markov Models", here abbreviated 

structure 

of a sequence family. The profiles use scores specific of the position of amino acids 
20 (or nucleotides) and position specific scores for the opening or the expansion of an 
insertion or deletion. Methods for the creation of profiles, starting from multiple 
alignments, have been introduced by Taylor (1986), Gribskov et al. (1987), Barton 
(1990) and Heinikoff (1996). 

HMMs provide an utterly probabilistic description of profiles, i.e. Bayes 1 theory rules 
the positioning of all probability (evaluation) parameters (compare Krogh et al. 1994, 
Eddy 1996 und Eddy 1998). The central idea behind this is that a HMM is a finite 
model describing the probability distribution of an infinite number of possible 
sequences. The HMM consists of a number of states corresponding with the columns 
30 of a multiple alignment as it is usually depicted. Each state emits symbols 
(remainders) corresponding with the probability of the symbol emission (specific of 
the respective state), and the states are linked with each other by probabilities of the 
changing of states. Starting from one specific state, a succession of states is 
generated by changing from one state to the other in accordance with the probability 
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of the changing of states, until a final state has been reached. Each state then omits 
symbols according to the probability distribution of emissions specific of this state, 
creating an observable sequence of symbols. 



The attribute Jiidden" has been derived from the fact that the underlying sequence of 
states cannot be observed. Only the sequence of symbols is visible. An assessment 
of the probabilities of changing of states and of emissions (the training of the model) 
is achieved by dynamic programming algorithms implemented in the HMMER 
package. 

10 

The sequences according to the invention may be aligned with other nuclear receptor 
sequences in order to create a multiple sequence alignment which is used as the 
basis for the creation of a HMM. 

If an existing HMM and a sequence are given, the probability that the HMM could 
generate the sequence in question, can be calculated. The HMMER package 
provides a numerical quantity (the Score) in proportion to this probability, i.e. the 
information content of the sequence indicated as bits, measured according to the 
HMM. ™ . . ~~ 

20 

See also Barton, G.J. (1990); Protein multiple alignment and flexible pattern 
matching, Methods Ezymol. 183: 403-427, Eddy, S.R. (1996): Hidden markov 
models. Curr. Opin. Strct. Biol. 6: 361-365, Eddy, S.R, (1998): Profile hidden markov 
models.Bioinformatics. 14: 755-763, Gribskov, M. McLachlan, A.D. und Eisenberg D. 
(1987): Profile analysis: Detection of distantly related proteins. Proc. Natl. Acad. Sci. 
- USA. 84: 4355-5358, Heinikoff, S. (1996): Scores for sequence searches and 
alignment, Curr. Opin. Strct. Biol. 6: 353-360, Krogh, A., Brown, M., Mian, I.S., 
Sjolander, K. und Haussler, D. (1994): Hidden markov models in computational 
biology; Applications to protein modelling, J. Mol. Biol. 235: 1501-1531, Taylor, W.R. 
30 (1986): Identification of protein sequence homology by consensus template 
alignment J. Mol. Biol. 188: 233-258. 
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In general the sequence are selected such that a query using a search sequence 
returns a result consisting of sequences which are at least to a certain degree similar 
to the query sequence. 

In one embodiment of the invention amino acid sequences of the present invention 
are used to model the three-dimensional structure of the protein. Initially, this 
involves the comparison of the protein sequence with the sequence of related 
proteins where the structure is known, such as the human PPARy ligand-binding 
domain (Nolte RT, Wisely GB, Westin S, Cobb JE, Lambert MH, Kurokawa R, 
10 Rosenfeld MG, Willson TM^ Glass CK, Milburn MV,Nature 1998 Sep 

10;395(6698):1 37-43). The three-dimensionale structure can then be modelled using 
computer programs. From the three-dimensional structure, binding sites of potential 
inhibitors or activators can be predicted. It can further be predicted which kinds of 
molecule might bind there. The predicted substances can then be screened to test 
their effect on nuclear receptor activity. 

EXAMPLES 



EXAMPLE 1 : CLONING AND EXPRESSION OF THE GENE ACCORDING TO THE 
20 INVENTION 

Construction of suitable vectors containing the desired coding and control sequences 
employs standard ligation and restriction techniques that are well understood in the 
art. Isolated plasmids, DNA sequences, were synthesized oligonucleotides were 
cleaved, tailored, and religated in the form desired. 

Site-specific DNA cleavage was performed by treatment with the suitable restriction 
enzyme (or enzymes) under conditions that are generally understood in the art, and 
the particulars of which are specified by the manufacturer of these commercially 
30 available restriction nzymes. 

See, e.g., New England Biolabs, Product Catalog. In general, about 1 pg of plasmid 
and/or DNA sequence was cleaved by one unit of enzyme in about 20 pi of buffer 
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solution. Often excess of restriction enzyme was used to ensure complete dig stion 
of the DNA substrat . Incubation times of about one hour to two hours at about 37°C 
are workable, although variations are tolerable. 

After each incubation, protein was removed by extraction with phenol/chloroform, and 
may be followed by ether extraction. The nucleic acid was recovered from aqueous 
fractions by precipitation with ethanol. If desired, size separation of the cleaved 
fragments was performed by polyacryiamide gel or agarose gel electrophoresis using 
standard techniques. A general description of size separations is found in Methods in 
1 0 Enzymology 65, 499-560 (1 980). 



Transformed host cells are cells which have been transformed or transfected with 
recombinant expression constructs made using recombinant DNA techniques and 
comprising mammalian nuclear receptor L66 encoding sequences. Preferred host 
cells for transient transfection are COS-7 cells. Transformed host cells may ordinarily 
express nuclear receptor L66, but host cells transformed for purposes of cloning or 
amplifying nucleic acid hybridization probe DNA need not express the nuclear L66 
receptor. When expressed, the mammalian nuclear L66 receptor protein was 
—^yplesi^loeated ^ 

Cultures of cells derived from multicellular organisms are desirable hosts for 
recombinant nuclear receptor L66 protein synthesis. In principal, any higher 
eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture. 
However, mammalian cells are preferred. Propagation of such cells in ceil culture has 
become a routine procedure. See Tissue Culture (Academic Press, Kruse & 
Patterson, Eds., 1973). Examples of useful host cell lines are bacteria cells, insect 
cells, yeast cells, human 293 cells, VERO and HeLa cells, LMTK- cells, and WI138, 
BHK, COS-7, CV, and MDCK cell lines. Human 293 cells are preferred. 



30 EXAMPLE 2: NUCLEAR RECEPTOR L66 TISSUE LOCALIZATION: 



A multiple tissue northern blot (Clontech, Palo Alto) was hybridized to a labeled 
probe. The blot contained about 03 to 3 M9 of poly A RNA derived from various 
tissues. Hybridization was carried out in a hybridization solution such as one 
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containing SSC (see Maniatis et al, ibid) at an optimized temperature between 50°c 
and 70°C, preferably 65°C. The filter was washed and a film xposed for signal 
detection (see also: Maniatis et al., Molecular Cloning: A laboratory Manual, Cold 
Spring Harbor Laboratory Press, N.Y.(1989)). 



EXAMPLE 3: NUCLEAR RECEPTOR L66 cDNA ISOLATION FROM HUMAN AND 
OTHER ORGANISMS: 

10 A cloning strategy was used to clone the L66 receptor cDNA from specific cDNA 
libraries (Clontech, Palo Alto) or alternatively, RNA was obtained from various tissues 
and used to prepare cDNA expression libraries by using for example an Invitrogen 
kit. (Invitrogen Corporation, San Diego). For the isolation of the L66 cDNA clone the 
chosen library was screened under stringent condition (see definitions above) by 
using an L66 specific probe. The cDNA insert of positive clones was subsequently 
sequenced and cloned in a suitable expression vector. 

Additionally, full length receptor L66 clones from human and mouse (Mus musculus) 
was obtained by using RACE PCR technology. In brief, suitable cDNA libraries were 
20 constructed or purchased. Following reverse transcription, the first strand cDNA was 
used directly in RACE PCR reactions using a RACE cDNA amplification kit according 
to the manufactures protocol (Clontech, Palo Alto). Amplified fragments were 
purified, cloned and subsequently used for sequence analysis. 

"To obtain information about the genomic organization (see also figures) of the 
receptor gene, genomic libraries were screened (commercially available clone 
libraries were used) with a receptor specific probe under stringent conditions. 
Positive clones were isolated and the complete DNA sequence of the putative 
receptor region was determined by sequence analysis (Maniatis et al., Molecular 
30 Cloning: A laboratory Manual, Cold Spring Harbor Laboratory Press, N.Y.(1989)) 
(human genomic clone deposited at "Deutsche sammlung von Mikroorganismen und 
Zellkulturen GmbH (DSMZ, International Depositary Authority under the Budapest 
Treaty in Germany) under number DSM 14483). 
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EXAMPLE 4: NUCLEAR RECEPTOR L66 LIGAND BINDING ASSAY: 

Saturation ligand binding analysis and ligand competition studies are carried out by 
using an over expressed L66 receptor-protein which is incubated with a labeled 
potential ligand at different concentrations. The L66 receptor may be immobilized on 
suitable surfaces. Various competitors are added in the presence of a labeled ligand. 
Bound and unbound ligand is separated be using e.g. gel filtration or charcoal based 
methods or simply removed by a washing sfep if the receptor protein L66 is 
10 immobilized. Receptor-bound ligand is detected by scintilation counting. 

EXAMPLE 5: NUCLEAR RECEPTOR L66 EXPRESSION ASSAY: 

Relative quantification was performed in multiplex PCR reactions using the ABI 
Prism® 7700 Sequence Detection System. Total RNA was used as template and was 
either ordered from Clontech or Ambion in the case of human normal tissue or was 
isolated from celNine^^ 
Microorganisms and Cell Cultures). 

20 Relative quantification was achieved by normalising the results for the presence of 
18S rRNA in the samples and subsequently by standardising the corresponding 
expression data to the expression levels detected in testis (testis = 1). A standard 
curve was generated by diluting total RNA from testis tissue 1:10 fold, starting from 
- 100 ng input amount and performing 7 dilution steps. For the detection of target 
mRNA in other samples the input amount was always 100 ng total RNA, samples 
were measured in triplicates. 

For the detection of 18S rRNA the endogenous control pre-developed assay reagent 
"Ribosomal RNA control (18S rRNA)" from Applied biosystems was used. For the 
30 detection of the target sequence, PCR primers and the probe were designed 

according to Applied biosystem's specifications using its Primer Express® software. 
PCR primers were ordered from Interactiva (forward primer (SEQ ID NO. 14): CGT 
GGG CTA ATG AAT TTT ACC AAG; reverse primer (SEQ ID NO. 15): GGC CCC 
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ATG GAG AAA TAT CAC T). The probe (SEQ ID NO. 16) CCA ATG AGG ATC AAA 
CTG CAC TAC AGA AGG G was ordered from Applied Biosystems and was labelled 
with FAM at the 5'end and contained a quencher (TAMRA) at the 3' end. 

EXAMPLE 6: FORMATION OF L66-RXR OR OTHER PROTEIN COMPLEXES 

In order to explore the functional properties of L66, the DNA binding properties of L66 
are analyzed. It has previously been shown that RXR is a common heterodimeric 
partner of various members of the nuclear receptor superfamiiy required for high 
10 affinity DNA binding (Hallenbeck et aL, PNAS 89: 5572-5576 (1989)), Kliewer et ah, 
EMBO J. 11: 1419-1435 (1992)). It has also been shown that DNA and ligand binding 
activities of the Dros. melanogaster ecdyson receptor (EcR) require heterodimer 
formation with RXR or USP (the homologue of RXR) (O'Malley in Endocrinology 125: 
1119-1120(1989)). 

Consequently, it is of interest whether L66 can interact with RXR, or with other 
members of the family. 



A two hybrid system is used. CV-1 cells are transiently transfected with 
cytomegalovirus promoter driven expression vectors containing the yeast GAL4 DNA 
binding domain (DBD) alone, GAL4 linked to L66 LBD (LBD; i.e. GAL4-L66) and the 
78 amino acid Herpes virus VP16 transactivation domain (VP) linked to the amon 
terminal end of the LBDs for human (or mouse) RXRa (VP-RXR), mouse 
PPARgamma (VP-PPAR) VDR (VP-VDR) and others. 

All cells are cotransfected with a luciferase reporter construct containing about 4 
copies of the yeast GAL4 upstream activating sequence and a p-galactosidase 
expression vector as internal control. 

30 CV-1 cells are grown in DMEM supplemented with 10% AG1-X8 resin-charcoal 
stripped calf bovine serum, 50 U/ml penicillin G and 50 |jg/ml streptomycin sulfate 
(DMEM-CBS) at 37°C in 5% C0 2 . One day prior to transfection, cells are plated to 
50-80% confluence using a phenol-red free DMEM with 10% resin charcoal stripped 



20 
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fetal bovine serum (DMEM-FBS). Cells are transfected (with reporter construct (300 
ng/10 5 cells), cytomegalovirus driv n receptor (100 ng/10 5 cells) and p-galactosidase 
expression vectors (500 ng/10 5 cells) by llpofection using N-{2-(2,3)- 
dfofeoyloxyJpropyf-N^.N-trimethyl ammonium methyl sulfate} according to the 
manufacturer's instructions (DOTAP, Boehringer Mannheim). 

After 2 hours the liposomes are removed and cells treated for 40 hours with phenol- 
red free DMEM-FBS containing famesol as the iigand. Cells are harvested and 
assayed for luciferase and p-galactosidase activity. 

10 

All points are preferentially performed in triplicate and should ideally vary less than 
10%. Experiments are repeated a 2-4 times. Data points are normalized for 
differences in transfection efficiency using p-galactosidase, and plotted as relative 
activity where the untreated reporter is defined to have an activity of 1 unit. 

Neither the GAL4 DBD, nor the GAL4-L66 chimera should be capable of stimulating- 
transcription from a reporter construct containing the GAL4 upstream activating 

transactivation domain linked to the RXRa-LBD (VP-RXR) should be inactive when 
20 expressed alone or with the GAL4 DBD. However, when GAL4-L66 and VP-RXR are 
coexpressed, the reporter could be activated, indicating that L66 and RXRa interact 
efficiently in cells. Using similar VP16-LBD fusion proteins, interaction can be tested 
between L66 and receptors for peroxisome proliferators/fatty acids (PPAR), vitamin 
D3 (VDR), thyroid hormone, (T 3 R), retinoic acid (RAR), or other members of the 
_ nuclear receptor superfamily (see also US Pat. 6,005,086). 



FIGURE CAPTIONS: 
FJG.1: 

30 Fig. 1 shows the cDNA sequence of the L66 gene according to the invention. It also 
shows the reverse complement thereof. 



FIG. 2: 

Fig. 1 shows the protein sequence of the L66 gene according to the 



FIG. 3: 

Fig. 3 shows the domain composition of FXR-p (L66) comprising a so called C 
binding domain "DBD" and a ligand binding domain "LBD" which are present ii 
members of the nuclear receptor superfamily. Also shown are the respective 
probability values for the presence of these domains which as can be seen in 
figure are extremely high. 

10 

FIG. 4: 



Fig. 4 shows the cDNA sequence from L66 from Mus musculus (SEQ ID NO. 
well as its reverse complement (SEQ ID NO. 18) and the protein sequence of 
from Mus musculus (SEQ ID NO. 1 9). 

■ FIG. 5: . . — — — ■ — — . .~— . — — — — 



Fig. 5 shows the DNA sequence (NC Fragment) of a splice variant of L66 (SEQ 
NO. 4). It also shows the reverse complement of the sequence (SEQ ID NO. 5). 

20 

FIG. 6: 



" Fig. 6 shows the DNA sequence (PolyAI Fragment, C-terminal end) of a splice 
variant of L66. It also shows the reverse complement of the sequence. This differs 
from the NC Fragment in so far as the poly-A-tail follows the exons 7 and 7B directly 
(SEQ ID NO. 7). 



FIG. 7: 



30 Fig. 7 shows the DNA sequence (PolyA2 Fragment, C-terminal end) of a splice 
variant of L66. It also shows the reverse complement of the sequence. This differs 
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(SEQ ID NO. 9). 

FIG. 8: 

Fig. 8 shows the DNA sequence (GF Fragment, C-terrninal end) of a splice variant of 
L66. It also shows the reverse complement of the sequence. This differs from the NC 
Fragment in so far as exon.8 is not present (SEQ ID NO. 11). 

10 FIG. 9: 

Fig. 9 shows the DNA sequence (SF Fragment, Oterminal end) of a splice variant of 
L66. It also shows the reverse complement of the sequence. This differs from the NC 
Fragment in so far as exon 7 and 8 are not present (SEQ ID NO. 13). 

FIG. 10: 



Fig. 10 is a schematic representation of the exon distribution within the splice 
variants found. 

20 

FIG. 11: 

Fig. 11 shows the result of a expression test on various cell lines and tissues. 
FIG. 12: 

Fig. 12 shows the genomic DNA sequence for L66 from Mus musculus (SEQ ID NO. 
20). 



PCT/EP01/10323 

'-51--- 

F1G. 13: 

Fig. 13 shows the reverse complement of the genomic DNA sequence for L66 from 
Mus musculus (SEQ ID NO. 21). 

FIG. 14: 

Fig. 14 shows a splice variant cDNA sequence of L66 from Mus musculus (SEQ ID 
NO. 22), its reverse complement (SEQ ID NO. 23) and the corresponding protein 
10 sequence (SEQ ID NO. 24). 

FIG. 15: 

Fig. 15 lists all the sequences according to the invention, their respective origin as 
well as in which figures they are depicted. 

Fig. 16 shows a BLASTP (improving the accuracy of PSI-BLAST protein database 
20 searches with composition-based statistics and other refinements; Nucleic Acids 
Res. 2001 Jul 15;29(14):2994~3005.) - alignment of L66 against tremblnew 
(sequence identifier |AF384555|AF384555) "NR1 H4"; product: "famesol receptor 
from Homo sapiens famesol receptor (NR1H4) mRNA, complete cds, alternatively 
spliced. //:gp|AF384555|14326451 gene: n NR1H4 w ; product: "famesol receptor"; 
Homo sapiens famesol receptor (NR1H4) mRNA, complete cds, alternatively spliced. 
The hit is scoring at : 7e-88 (expectation value) for an alignment length (overlap) : 
389 having 46 % identities (Scoring matrix : BLOSUM62 (used to infer consensus 
pattern). 
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Fig. 17 A shows the intron - exon composition of the mouse L66 gene (numericals). 

Fig. 17 B shows the intron - exon composition of spliceform variant 1 of the mouse 
L66 gene (numericals). 



FIG. 18: 

Fig. 1 8 A shows the intron - exon composition of the mouse L66 gene (sequences). 

Fig. 18 B shows the intron - exon composition of spliceform variant 1 of the mouse 
L66 gene (sequences). 

FIG. 19: 

Fig. 1 9 shows a multiple sequence alignment of the L66 C4 zinc finger domain and 
shows the conservation of this domain. 



FIG. 20: 



Fig. 20 19 shows a multiple sequence alignment of the L66 protein and demonstrates 
the conservation of the L66 Ligand Binding Domain (LBD). 
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CLAIMS: 

1 . A nucleic acid molecule coding for a nuclear receptor which is selected from the 
group comprising: 

a) the nucleotide sequences set forth in SEQ ID NOs: 1, 4, 6, 8, 10, 12, 17, 20 
and/or 22; 

b) or complement thereof as set forth in SEQ ID NOs: 2, 5, 7, 9, 1 1 , 1 3, 18, 21 
10 and/or 23; 

c) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence 
which is the complement of the nucleotide sequence of SEQ ID NOs: 1, 4, 6, 
8, 10, 12, 17, 20 and/or 22 under conditions of high stringency, and 

d) a nucleic acid which hybridizes to a nucleic acid having a nucleotide sequence 
which is the complement of the nucleotide sequence of SEQ ID NOs: 2, 5, 7, 
9, 11, 13, 18, 21 and/or 23 under conditions of high stringency. 



20 2. The nucleic acid molecule of claim 1 which is genomic DNA. 

3. The nucleic acid molecule of claim 1 which is cDNA. 

4. The nucleic acid molecule of claim 1 which is RNA. 

~5. A nucleic acid molecule comprising the. nucleic acid molecule of any of claims 1 to 
4 and a label attached thereto. 

6. A vector comprising the nucleic acid molecule of claim 1 . 

30 

7. The vector of claim 6, which is an expression vector. 

8. A host cell transfected with the vector of claim 6 or 7. 
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9. A host cell transfected with the expression vector of claim 7. 

10. A method of producing a polypeptide comprising the step of culturing the host cell 
of claim 9 in an appropriate culture medium to, thereby, produce the polypeptide. 

1 1 .An isolated polypeptide encoded by any portion of the nucleic acid of claim 1 . 

12. A polypeptide selected from the group comprising: 

10 the amino acid sequences set forth in SEQ ID NO: 3, 24 and/or 19. 

1 3. A method for screening for agents which are capable of inhibiting the 

cellular function of the nuclear receptor L66 comprising the steps of: 

a) contacting one or more candidate agents with a polypeptide according to 
claim 11 or 12 

b) removing unbound agent(s) 

receptor. 

20 

14. A method for inhibiting the cellular function of the nuclear receptor L66, 
comprising the steps of. 

a) contacting a cell with a binding agent of a polypeptide according to claim 
11 or 12, 

whereby the cellular function of L66 is inhibited. 

15. Method according to claim 14, 

characterized in that the binding agent is an antibody. 

30 

16. Method according to claim 14, 

characterized in that the binding agent is RNA. 



17. Method according to claim 14, 



/ 
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18. Method according to claim 14, 

characterized in that the binding agent is a ribozyme. 

19. Method according to claim 14, 

characterized in that the cell is in a body. 

20. Use of a nucleic acid or protein sequence according to SEQ ID NO.: 1, 4, 6, 8, 10, 
12, 17, 20, 22, 2, 5, 7, 9, 11, 13, 18, 21, 3, 19, 24 and/or 23 for the construction of 
multiple nuclear receptor specific sequence alignments. 



21. Use of the sequences according to claim 20 for the construction of protein 
sequence alignments. 
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Fig. 1 



SEQ ID NO. 1.: (cDNA Sequence Homo sapiens L66) 
5 1 - 

cctggaataa aaaggtccag accaacctat tcttcctcga gaaataaggg acaggaagaa 60 

ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 120 

tgtgaaggtt gcaaagaaat acctatggta aaaaatttta aaactttttt attgggtttt 180 

tttcaatgta gcatcimnca aaatgcagta tatagttgca ggaatggtag tcactgtgaa 240 

atggacatgt acatgcgtag aaaatgtcaa gagtgcagac tgaaaaagta taaggcagta 3 00 

ggaatgttgg cagaatgttt gctcacagaa atccaatgta aattaaagag acttcaaaag 360 

aactttaagg agaagaatca tttttactct aacatcaaag tggaagagga aggagtagac 420 

cacagttttc tatcatccac cactagacct ggaaaagaaa gcatggaact aactgaagag 480 

gaacatcagc tcattaataa cattgtggct gctcatcaaa aatataccat tcctttagaa 540 

gaaacaaatt tgtatctgca ggaacataca aatcctgaao tgagcttttt gcaactctca 600 

gagacagcag tcctacacat acgtgggcta atgaatttta ccaaggggct cccaggattt 660 

gaaaatttgg ccaatgagga tcaaactgca ctacagaagg gatcaaaaac tgaagtgata 720 

tttctccatg gggcccaact ttacaataca atgataattt ccatatgttt gattctaccc 780 

tatgtttgga tgaaaataca ttttcgtatc agttttttgg gtgttactga agaatttatt 840 

acannnctgt tttacttcta caaaagaatg agcaaacttg atgtaactaa tactgaatat 900 

gctctgcttg cagcaacaat tgttttttca gatcgtccat gccttaaaaa taagcaatat 960 

atggaaaatt tannngaacc agttttacaa atattgtata agtattcaaa aatgtatcat 1020 

ccagaagacc cannricafctt tgcccatctc atatggaagc atactgaact gagaactctg 1080 

aattataacc attcagaaat acttagcact tggaaaacaa aggaccccaa attggctact 1140 

ttactctctg ag - 3* 1152 



SEQ ID NO. 2: (cDNA Reverse Complement L66) 

5*- 

ctcagagagt aaagtagcca atttggggtc ctttgttttc caagtgctaa gtatttctga 60 

atggttataa ttcagagttc tcagttcagt atgcttccat atgagatggg caaaatgnnn 120 

tgggtcttct ggatgataca tttttgaata cttatacaat atttgtaaaa ctggttcnim 180 

taaattttcc atatattgct tatttttaag gcatggacga tctgaaaaaa caattgttgc 240 

tgcaagcaga gcatattcag tattagttac atcaagtttg ctcattcttt tgtagaagta 300 

aaacagnnnt gtaataaatt cttcagtaac acccaaaaaa ctgatacgaa aatgtatttt 360 

catccaaaca tagggtagaa tcaaacatat ggaaattatc attgtattgt aaagttgggc 420 

cccatggaga aatatcactt cagtttttga tcccttctgt agtgcagttt gatcctcatt 480 

- ggccaaattt tcaaatcctg ggagcccctt ggtaaaattc attagcccac gtatgtgtag 540 

gactgctgtc tctgagagtt gcaaaaagct cagttcagga tttgtatgtt cctgcagata 600 

caaatttgtt tcttctaaag gaatggtata tttttgatga gcagccacaa tgttattaat 660 

gagctgatgt tcctcttcag ttagttccat gctttctttt ccaggtctag tggtggatga 720 

tagaaaactg tggtctactc cttcctcttc cactttgatg ttagagtaaa aatgattctt 780 

ctccttaaag ttcttttgaa gtctctttaa tttacattgg atttctgtga gcaaacattc 840 

tgccaacatt cctactgcct tatacttttt cagtctgcac tcttgacatt ttctacgcat 900 

gtacatgtcc atttcacagt gactaccatt cctgcaacta tatactgcat tttgimngat 960 

gctacattga aaaaaaccca ataaaaaagt tttaaaattt tttaccatag gtatttcttt 1020 

gcaaccttca caggtaagtg cattataatg atatggtgat ggtgatgctt tatcaccaca 1080 

aactacacag aattcttcct gtcccttatt tctcgaggaa gaataggttg gtctggacct 1140 

ttttattcca gg - 3* 1152 
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Fig. 2 

5EQ ID NO. 3: (Protein L66 Homo Sapiens) 

PGIXRSRPTY SSSRNKGQEB FCWCGDKAS PSPYHYNALT CEGCKBIPMV KNFKTFLLGF 60 

FQCSIXQNAV YSCRNGSHCK MDMYMRRKCQ ECRLKKYKAV GMLAECLLTB IQCKLKRLQK 120 

NFKEKNHFYS NIKVEEEGVD HSFLSSTTRP GKESMELTEE EHQLINNIVA AHQKYTIPLE 180 

ETNIiYLQEHT NPELSFLQLS ETAVLHIRGL MNFTKGLPGF KNLANEDQTA LQKGSKTEVI 240 

FLHGAQLYNT MIISICLILP YVWMKIHFRI SFLGVTEEFI TXLFYFYKRM SKLDVTNTEY 300 

ALLAATIVFS DRPCLKNKQY MENLXEPVLQ ILYKYSKMYH PEDPXHFAHL IWKHTELRTL 360 

NYNHSEIXiST WKTKDPKIiAT LLSE 384 
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Fig. 3 



Domain composition: 

Domain Region from protein 



E- value 



ZnF_C4 
HOIiI 



19-105 
201-360 



14.8e-15 (A below) 
9.99e-13 (B below) 



B 




HOLI Domain 
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cDNA L66 from Mus musculus (SEQ ID NO. 17) 



gagaaafcggg atctgcactc attttcctga 



at caac t art cr 


tgataeagat 


60 


fctcctccagc 


gttacagtcc 


120 


oacaatacacr 


t crcracrcr c acrt 


180 




t cacgat gat 


240 






300 


caacra t at c a 


ctacaatgca 


360 




tacacrfccrfcafc 


420 


t* crncicacraaa 


atgccaagag 


480 


aafccrfcttCTCt 


cacagagat c 


540 


rmr»r | t' (TCCCi" 
MM v> t» y v* v*. v* 


ai* aeccfccrec 

y LrCLV \^%p~ 


600 


catccagcac 


cagat ct ggg 


ObU 


atcggcttct 


gaataccata 


720 


caagcaaact 


tctgcaggag 


780 


tatcagtcct 


gcacatacaa 


840 


atttaaccac 


tgaggatcag 


900 


ttcatgtagc 


ccagctttat 


960 


caaagccctc 


agctgggaca 


1020 


ctccggaaaa 


ctttctcaag 


1080 


aatttattgc 


ctcactatct 


1140 


ctgaatatgc 


tctgcttacg 


1200 


ataagcagca 


tatagaaaac 


1260 


aaatgtacca 


tccagaagac 


1320 


tgagaactct 


gagtcacagc 


1380 


ggttggtgat 


gttattctct 


1440 






1470 



tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 

tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120 

cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 

aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 

tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300 

atgaagttca ctcattcttc tgtagaagta agatagtgag gcaataaatt ctttagtaat 360 

atcagttaga ggagccgaag ggtagccttc cttgagaaag ttttccggag aatgaacact 420 

ttcatcagcg ctaggattat gcacctctag tgtcccagct gagggctttg ctggtctcat 480 

agtacttcca gaggttgagt ctttcccacc ataaagctgg gctacatgaa ggaacatcac 540 

ttcagttttt gacgccttct gtaatgcagc ctgatcctca gtggttaaat tttcaaatcc 600 

tgggagtccc ttggtaaact tcattagccc ttgtatgtgc aggactgata cctctgagag 660 

tctcagaaaa cttagttcgg ggttggaacc ctcctgcaga agtttgcttg tttctcccaa 720 

gggaatcatg gatttttggt gagcagtcac tatggtattc agaagccgat gttcctcttg 780 

agttagagtc atgttgtcct gaaccccttt cccagatctg gtgctggatg acacgtgttt 840 

ggtgtctgct ccttcatctt ccacttggat ggcagggtac agggcaggcc cgtgcttgaa 900 

gttcttgcga agtctctttg acttacactg gatctctgtg agcaaacatt ctgccaacat 960 

ccccaccgcc ttacacttct tcagtctgca ctcttggcat tttctgcgca tgtacatgtc 1020 

catttcacag tgaccaccgt tcttgcaaga atacactgca ttcttggtaa tgctacgccg 1080 

gaaaaagcct ttgcacccct cacaagtaag tgcattgtag tgatatcctg atgccttatc 1140 

accgcagacc atgcagagca tgtcatcttg acccttccag cgtaaagaac aagttgctct 1200 

gcacctttgt gccccaggga attcatcttc afccatcgtga acaaccacat aagtggactg 1260 

accagattct cgagcgtcga gtccacacca actgcctcca ctgtactgtg gatccaagcc 1320 

atgtgtgttg aaatgacttt gtaaagatgg ggactgtaac gctggaggaa actgagcggt 1380 

agaatattga caatagggtg gttcttggaa atctgtatca cacagctgat aatggaattg 1440 

ttctggcaaa atatctggtt ttattaacat 1470 
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Fig. 5 

lt66 NC Fragment 

cctggaataa aaaggtccag accaacctat tcttcctcga gaaataaggg acaggaagaa 60 

ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 120 

tgtgaaggtt gcaaagcatc aacaaaatgc agtatatagt tgcaggaatg gtagtcactg 180 

tgaaatggac atgtacatgc gtagaaaatg tcaagagtgc agactgaaaa agtataaggc 240 

agtaggaatg ttggcagaat gtttgcacac agaaatccaa tgtaaattaa agagacttca 300 

aaagaacttt aaggagaaga atcattttta ctctaacatc aaagtggaag aggaaggagt 360 

agaccacagt tttctatcat ccaccactag acctggaaaa gtgattcagg aaagcatgga 420 

actaactgaa gaggaacatc agctcattaa taacattgtg gctgctcatc aaaaatatac 480 

cattccttta gaagaaacaa atttctgcag gaacatacaa atcctgaact gagctttttg 540 

caactctcag agacagcagt cctacacata cgtgggctaa tgaattttac caaggggctc 600 

ccaggatttg aaaatttggc caatgaggat caaactgcac tacagaaggg atcaaaaact 660 

gaagtgatat ttctccatgg ggcccaactt tacagtcaga aacaatcagc ctctgaaagt 720 

tctgtgagaa tattaaatca ttcagattat acaccaaatt gtcacaatag gagtggtgat 780 

agaagtctta tttgttctat ggaaaaattt tacaatgaag aatgtccttc tactactcta 840 

attggtttca agaagctcat ctggaaaatg gtgataataa tagaacttac ctcatacagt 900 

attgtgacta ctacataaaa taatacatag agatcgtcca tgccttaaaa ataagcaata 960 

tatggaaaat ttacaagaac cagttttaca aatattgtat aagtattcaa aaatgtatca 1020 

tccagaagac ccatagcatt ttgcccatct catatggaag catactgaac tgagaac 1077 



L66 NC Fragment {Reverse Complement) 

gttctcagtt cagtatgctt ccatatgaga tgggcaaaat gctatgggtc ttctggatga 60 

tacatttttg aatacttata caatatttgt aaaactggtt cttgtaaatt ttccatatat 120 

tgcttatttt taaggcatgg acgatctcta tgtattattt tatgtagtag tcacaatact 180 

gtatgaggta agttctatta ttatcaccat tttccagatg agcttcttga aaccaattag 240 

^™"ag^^ _J^° t 

accactccta ttgtgacaat ttggtgtata atctgaatga tttaatattc tcacagaact" 1S"SG " -~-~~«- 

ttcagaggct gattgtttct gactgtaaag ttgggcccca tggagaaata tcacttcagt 420 

ttttgatccc ttctgtagtg cagtttgatc ctcattggcc aaattttcaa atcctgggag 480 

ccccttggta aaattcatta gcccacgtat gtgtaggact gctgtctctg agagttgcaa 540 

aaagctcagt tcaggatttg tatgttcctg cagaaatttg tttcttctaa aggaatggta 600 

tatttttgat gagcagccac aatgttatta atgagctgat gttcctcttc agttagttcc 660 

atgctfctcct gaatcacttt tccaggtcta gtggtggatg atagaaaact gtggtctact 720 

ccttcctctt ccactttgat gttagagtaa aaatgattct tctccttaaa gttcttttga. 780 

agtctcttta atttacattg gatttctgtg tgcaaacatt ctgccaacat tcctactgcc 840 

ttatactttt tcagtctgca ctcttgacat tttctacgca tgtacatgtc catttcacag 900 

tgactaccat tcctgcaact atatactgca ttttgttgat gctttgcaac cttcacaggt 960 

aagtgcatta taatgatatg gtgatggtga tgctttatca ccacaaacta cacagaattc 1020 

ttcctgtccc ttatttctcg aggaagaata ggttggtctg gaccttttta ttccagg 1077 
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Ii 66 PolyAl Fragment 

cccaacttta cagtcagaaa caatcagcct ctgaaagttc tgtgagaata ttaaatcatt 60 

cagattatac accaaattgt cacaatagga gtggtgatag aagtcttatt tgttctatgg 120 

aaaaatttta caatgaagaa tgtccttcta ctactctaat tggtaatatg actcaatatg 180 

aaatattata ttggatgcaa aaatttgtat ataatgttta actttcttat actgctttga 240 

gatacaatga taatttccat atgtttgatt ctaccctatg tttggatgaa aatacatttt 300 

cgtatcngcc caaaaaaaaa aaaaaaaaaa a 331 



Ii 66 PolyAl Fragment (Reverse Complement) 

tttttttttt tttttttttt gggcngatac gaaaatgtat tttcatccaa acatagggta 60 

gaatcaaaca tatggaaatt atcattgtat ctcaaagcag tataagaaag ttaaacatta 120 

tatacaaatt tttgcatcca atataatatt tcatattgag tcatattacc aattagagta 180 

gtagaaggac attcttcatt gtaaaatttt tccatagaac aaataagact tctatcacca 240 

ctcctattgt gacaatttgg tgtataatct gaatgattta atattctcac agaactttca 300 

gaggctgatt gtttctgact gtaaagttgg g 331 



Fig. 7 

L 66 PolyA2 Fragment 

cccaacttta cagtcagaaa caatcagcct ctgaaagttc tgtgagaata ttaaatcatt 60 

cagattatac accaaattgt cacaatagga gtggtgatag aagtcttatt tgttctatgg 120 

aaaaatttta caatgaagaa tgtccttcta ctactctaat tggtaatatg actcaatatg 180 

aaatattata ttggatgcaa aaatttgtat ataatgttta actttcttat actgctttga 240 

^^Gaa^^aatttccat^ tttggatgag gaaaaaaaaa 300 



L 66 PolyA2 Fragment (Reverse Complement) 

tttttttttt tttttttttt cctcatccaa acatagggta gaatcaaaca tatggaaatt 60 

atcattgtat ctcaaagcag tataagaaag ttaaacatta tatacaaatt tttgcatcca 120 

atataatatt tcatattgag tcatattacc aattagagta gtagaaggac attcttcatt 180 

gtaaaatttt tccatagaac aaataagact tctatcacca ctcctattgt gacaatttgg 240 

tgtataatct gaatgattta atattctcac agaactttca gaggctgatt gtttctgact 300 

gtaaagttgg g 311 



Fig. 8 

L 66 GF Fragment (DNA Sequence) 

ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 

tggggcccaa ctttacagtc agaaacaatc agcctctgaa agttctgtga gaatattaaa 120 

tcattcagat tatacaccaa attgtcacaa taggagtggt gatagaagtc ttatttgttc 180 

tatggaaaaa ttttacaatg aagaatgtcc ttctactact ctaattgatc gtccatgcct 240 

taaaaataag caatatatgg aaaatttaca agaaccagtt ttacaaatat tgtataagta 300 

ttcaaaaatg tatcatccag aagacccata gcattttgcc catctcatat ggaagcatac 360 

tgaactgaga actctgaatt ataaccattc agaaatactt agcacttgga aaacaaagga 420 

cccca 425 



L 66 GF Fragment. (DNA Sequence Reverse Complement) 

tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 

gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 

ttgaatactt atacaatatt tgtaaaactg gttcttgtaa attttccata tattgcttat 180 
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ttttataggca-tggacgatca-attagagtagte _ 
ccatagaaca aataagactt ctatcaccac tcctattgtg acaatttggt gtataatctg 
aatgatttaa tattctcaca gaactttcag aggctgattg tttctgactg taaagttggg 
ccccatggag aaatatcact tcagtttttg atcccttctg tagtgcagtt tgatcctcat 
tggcc 



300 
360 
420 
425 
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Fig. 9 

L 66 SF Fragment {DMA Sequence) 

oaccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 
ctttacagtc agaaacaatc agcctctgaa aatcgtccat sccttaaaaa 
tiaacaatat atggaaaatt tacaagaacc agttttacaa atattgtata agtattcaaa 
aaScat ccagaagacc catagcattt tgcccatcto atatggaagc atactgaact 
gtgaactctg aatLtaacc attcagaaat acttagcact tggaaaacaa aggacccca 

L 66 SF Fragment (DMA Sequence - Reverse Complement) 

taqqgtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 
g??caSatg Sccatatg Igatgggcaa aatgctatgg gtcttctgga tgatacattt 
Kgaa?act? atacaatatt tgtaaaactg gttcttgtaa attttccaW Jattgcttat 
ttttaaagca tggacgattt tcagaggctg attgtttctg actgtaaagt tgggccccat 
gg agaaa?a? Sttcagtt tttgatccct tctgtagtgc agtttgatcc tcattggcc 



60 
120 
180 
240 
299 



60 
120 
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240 
299 
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Fig. 11: 
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Fig. 12: 

ctttggggtc atagcattta ttaggtattg ttagttccat ttactgtgtt cctattgtgc 60 

tggacggcaa gtcctgagat ggaatagatg tgttgaacta actttctgtt gagtataaaa 120 

cattttgagg tttaaaacaa ttatgaactt ttttgacctt ttatgatttg ttctttttta 180 

tgttaataaa accagatatt ttgccagaac aattccatta tcagctgtgt gatacagatt 240 

tccaagaacc accctattgt caatattcta ccgctcagtt tcctccagcg ttacagtccc 300 

catctttaca aagtcatttc aacacacatg gcttggatcc acagtacagt ggaggcagtt 360 

ggtgtggact cgacgctcga gaatctggtc agtccactta tgtggttgtt cacgatgatg 420 

aagatgaatt ccctggggca caaaggtgca gagcaacttg ttctttacgc tggaagggtc 480 

aagatgacat gctctgcatg gtctgcggtg ataaggcatc aggatatcac tacaatgcac 540 

ttacttgtga ggggtgcaaa ggtaagggta aggcgcaacc aaacagacag tgctgcgcac 600 

gcgcacacgt gtgtatgtgt gtgtgcacgt gcgtgtgcaa tcgttataaa acatgagaaa 660 

caaaaccagg aaaaagtatg tctaaaggga tcaatctgtg tgatcaaaga taggagaata 720 

aaatgttaag tttaaggtta gtttctttcc tttcctttcc tttcctttcc tttcctttcc 780 

tttcctttcc ctttcccttt ccctttccct ttcccttcct ctttctttct ttctttcttt 840 

ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctttct ctctctctct 900 

ctctctctct ctctctctct ctctctctct ctctctctct ctccctctct ctctctctct 960 

ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctccct 1020 

ccctcccccc ctctctctct tttgaagtag agtatcattg tatagccctg gctggcctag 1080 

acctcactat ggagatcagg ttggctttga actcacagat acctgcctgt ctctgctttc 1140 

taggtgctgg gattaaaggt gtgggccacc atgcctggac catttctttc tttatttttt 1200 

tttctcctcc cttgtcttcc ccttcgtcct catcttctcc tcctcatctt tcttcttttt 1260 

ttgagaaaag gtctctctat gcatccctgg ctgtcctgga actcactgtt tagaccaggc 1320 

tgtccttgaa ctctcagaga tccacctacc tttgtatcct tactgctggc attaagagca 1380 

ttcactacca ggtctgtctc ttcatttttg atgaagactt tcacacttta ctttaaactc 1440 

gatacatata agcaaacgtt ttccctattt ctagaagaat ttgatgctgg aacaaaacat 1500 

gacaatgtaa agatctgtaa accatgacac ctttttatgt ggttgataaa atttatacac 1560 

ttggagaaaa tttgaaataa atgacttcta tgattctaag tcaataaatt ttaaaaagtg 1620 

tttcgtatag aaagagaaca agctgaaagt cgggaagcaa gggccagtgt ttgcaatgat 1680 

atttgtaaag agcggaagcc tctctgataa ataacagttc tgaaacagtc taccactcca 1740 

ttggatttga ttctagcacc ctctgaaaca aaacaaaaca acccaaacaa cagaaaacag *1860~ 

tttagtcaat gtaacacaaa taaacgtgtt tgtttagggt aaaattttac ctgtaaatct 1920 

taaacctcaa cactggcaaa gaatagccaa aggatatgag gtgacagagt cctgtggcga 1980 

gtagttaggc aggggtggcc tgctcttgac ggtgacgggt tactgttaaa actgagctga 2040 

agaacccagc acagttttga aaacctgagc tgagcttttt cttccttcgt tgaactgaaa 2100 

tggcgaaact ccatctttca agtgtccact gcaagagagt aagtggtggg gctttgggac 2160 

aactcatagg agggagagca ggaaatggca ggtattactg tgtaataagg ttcgaaggag 2220 

gaaaagggat gggaactatc tggcaagtgc tgggaaaggg agagagagca tctgcggatg 2280 

agggagggag ggaccacctt tgtgggctgc aactcagact gggagggttt agaaaagaga 2340 

ggcttttctg aattttttct ttattttagg tgaccatgta ttctctcaaa ggaaactcga 2400 

gagatgcatt gtaggatctt gatattttgt ataatgcagt taaaaaaaaa attcttatag 2460 

gtaatttacc tatagtaggg tagagagttg tcttaatgtg ataatgatac tccatttaat 2520 

ctccagacgt tataaagaaa gacccagcct caggatgtca actgaactat actaactata 2580 

aagaattata gaattagcta actttccaga taaatgtata tttaaaaaaa tgtcgggaat 2640 

aaaaaaaacc ctatttaatt gaaatactct agttagcttg attactttct ataaaggtaa 2700 

aatgttaata aaaatgagaa gtaactaaac atatccatgt caaactttga ccatctttgt 2760 

tgttaggctt tttccggcgt agcattacca agaatgcagt gtattcttgc aagaacggtg 2820 

gtcactgtga aatggacatg tacatgcgca gaaaatgcca agagtgcaga ctgaagaagt 2880 

gtaaggcggt ggggatgttg gcagaatgta agtgccagtg tgttcgtggc ttgcttttct 2940 

gaggggattt aacttttccc agccttacct tcctgacaag gaggacatga tagtggtggt 3000 

ggtttttcca gggcaagaat cgcccattgc actagggatg tgctgactga cccctgtgat 3060 

ttccacaaat gtgggaaact caggtgcatc atttgtgtta gtgaaggact gtattcacac 3120 

tctcctctat ataataaaaa tgaatttaaa aattaataaa aaattaaaaa tttccaacct 3180 

tcttgaggac ataaagcatt cttcaagagt tattttgaac atggccgtat tcgtttgtat 3240 

atgcacacca cacacatatg tacagtaatt gcgccacgtg gatggcccat tatcaattta 3300 

agtagtgatt cccataaatg ttgccctgtt ctatattgct ctaattttga aatcagtcat 3360 

agtttcatct taattagatg ttctctgaga cctgaagagt agttaggcag ttaagagcat 3420 

taggagagga cctgggcccg gtccccagct tccacgtggc ccctcacaat catttataac 3480 

tcagttcctg agcatcccag gccctcttct gacctctgca ggtatgagtg ttgtgtgcat 3540 



SUBSTITUTE SHEET (RULE 26) 



4560 
4620 
4680 
4740 



WO 02/22817 ' PCT/EP01/10323 

13/37 
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acacacatgc aagcaaaaca cacacataaa ataaaagaaa cctccatagt gatttaaaaa 3600 

aggtattgta tgtttctgtt acttaaccta aaaatatata ctgccttgaa aagtgcaaag 3660 

ttaaagttag tagttcccaa aaagaaagaa agaggaggag aaagaagaga aaaggaagtc 3720 

ttccctttct gagaaggggt tctcagagtc ctaacacact acagtcttag gacaattgaa 3780 

gggtaactgg atctctggac ccctgacatc ctcatatgta aagtccacta atagtctcac 3840 

gagtgtcagg actcatctct gaatatcaac acattagcat atgagcagtg ggagatacaa 3900 

ctcagtccaa agaaaccttt gaaaaagttg tcttttattt tattcattaa tttaatttgt 3960 

attcaggatc tcactatgta gctgacctgg aacttgctat gtacgccagg ctggcatcaa 4020 

actcagagat ccccctgcct ctgccctcca atttctggct caaaagttgt cttttaacct 4080 

gtggcaggca atgcatgtga aagggaagtt ggagtcctct gaggaactct atgcataaaa 4140 

tgagtcatcc tacacttaaa gaaacttaaa gaaatbcbgg caaggtgaag ctcgagagag 4200 

ttggtgagca gtcacggtga ttggcgagca taagctggtc cgcaccctgc caccabbgcb 4260 

tggatctgca cacttcattt acctttcagt tgtcctaata ctgccatgtg ttagtgctct 4320 

gacggcccct cctcctcttc cttccctcat tctcctttcc cbbbccbccb ttcttttcct 4380 

cttcctccfcc ttccccctct tcttcctcct tttcctcctc bbcgbbcbcb tcttgtggtt 4440 

ttccttctca tggtcctggg ggttgagttc aggtttctca catcctagac aatctaccac 4500 
taagctacac ccatagactc ctttcctact aaaaaacact gagttgccat ggctggtatt 
caacgtgaga tcatcttgta acagtgtctt aagtagctgg ggbbabaggb gtttactacc 
abgbbcaata cbcaababgc ttaaaatggc tttatttttt tgtgtattct gttctctgac 
aaataataaa ataatataga ggggttttaa ggbbbbbgbb tatacttcag ctcctcatta 

gbbbbbbabb ttataggtgt gtttttaaaa atactttgaa aacgacaggt atattaatag 4800 

ttggatactt ttagtagctg ctaaaagtac ttagagtagt ccagtccagt ccagtacata 4860 

caetacdgct cacaaoeagc tatagtctca cttcaagggg cbabgacccc tctgtgctct 4920 

gtgggcacct gcactcacct gtgcatttcc ccatacagac tcatgcaagt aattataaaa 4980 

tatttaaaaa -caaaGaaaaa caatcctaaa ataaatatta bbtgccbbbg aaaaagaabb 5040 

aaatgctttg bcbgacabag gaaaaattga cttagcttat bbbbaaaaab cctaagatag. 5100 

tcttttttaa ccttattaab aaaaccagtg afcaatagtca tagataatgt aabbcbacba 5160 

ttctatcaaa tgactttttc aattccaaga bgcaacbaab gtgtaaatct cbbabgbaaa 5220 

aatctaaaca taatttggaa abgaaaaabb aattcttaat cbbbgbbaba acbabbbbba 5280 

agcagacaag gtgtctatga agggaaaagc agaabaggaa abbbaataga tgcagacatg 5340 

cacqaqagat abcbaggacb atttactcac tgaagaocat tttaatcaaa ccctacagtg 5400 

agatccagtg taagtcaaag agacttcgca agaacttoaa gcacgggcct gccctgtacc 5520 

ctgccatcca agtggaagat gaaggagcag aeaccaaaca cgtgtcatcc agcaccagat 5580 

ctgggaaagg ggcaagacgc ttcttccata cagbcbctbg tgtctttatg aacagbgfcgb 5640 

aacaagcaca cattacaggc gcdctctgtt bccbbafeagg ttcaggacaa catgactcta 5700 

acbcaagagg aacatcggct tctgaatacc atagtgactg cbcaccaaaa atccatgatt 5760 

cccttgggag aaacaagcaa acttgtatgt atccattgct ggcaagaaat aatttgtaat 5820 

gaabgatabt aggaaabgab agcaagaaca gtaaaaaaga aacaccttag taattatcag 5880 

aaagacabae tatgaatcat ttttatttat ttattttttt babggaccag ctgcaggagg 5940 

gttccaaccc cgaacbaagb tttctgagac tctcagaggt atcagtcctg cacatacaag 6000 

ggctaatgaa gtttaccaag ggadtcccag gtaaaattct gtttcattat tttttaagga 6060 

bbbabbbabb tttafegtttsa tgtgttatgc atgttttgat tgcaattata babbcababc 6120 

atgtacacaa catattgtgg aggccagaag agagtgtttg gtcccctaga aotagagttt 6180 

-ccbaccgabg gctgtgaatt accatgtggg cgttggaacc agagtctctg caagagcaag 6240 

acaggctctt aactgcfcgat ctgtctttct agcccccaat ttttaaaaaa gatbafcttat 6300 

ttatttattt atttattbab bbabbbabbb atbtatctga gbacabbgba gcbgbacaga 6360 

bggtbgtgag ccbbcabgbg gbbgbbgaga abbgaattbb baggaccbcb gcbcacbcbg 6420 

gbcggccccg ctcgtbcbgg cccaaagabb tcabaagtac acbgbagcbg bcbbcaggtg 6480 

caccagaaga gftgcgbbaga bcbcgbbacg gabggbtgcg agccaccgtg bggtbgcbgg 6540 

gactbgaact cagggcbbbb ba^gaagagb agbcagbgcb cbbacccacb gagctabcbc 6600 

bccagcccca aabbbbcbtb btbbbaaagg gabagbabab caaacagaaa bbbabagbbg 6660 

bggtacbaga bcbabcgcbc agaggcagag caccbgbcba abgbgbgbga agcccbgggb 6720 

bbbcbagcaa bgbbgcbbcb aacbaaaaaa aaaaaaaabg bbbabbbgbc bbbgbcaabb. 6780 

aabcagabga gbgacaabcb aaababbbac aaaaabbabb ataabgabgb gggcbbbcbb 6840 

agabgaaaca agaaacaaag aaggbbbccb cbabaatacb aaacaagbca cagbbbgacc 6900 

aaaacbbaba babaacbcag cbbbabbaba bgebbagagg bacagaaacb cbbbacaaaa 6960 

bbacabbcbb caaggcbaab gcagcbabgb bgabbbgbcc bbgcbgbaag aaagbgbaac 7020 

cbagbgcbga baaaagtgcJc ccaaaabaga aaaaaagaab bbbabbcaga bbbcaabbba 7080 

babactabbg bbabbcccca gaagcacbba gggbbaabaa gbaaababbb bbaaacabbb 7140 
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Fig. 12continu d: 



aatttatgaa 
atttattttt 
tcaaaaactg 
tctggaagta 
cagacagaca 
cgcacacact 
gaactcagta 
accatcagcc 
tgtgctcagt 
attttatttt 
tcacattcat 
tcttgtgaga 
ttccttccca 
ccctgctcct 
ggggtggggg 

gagactggat 
ctgtgggtgg 
gtgatcaaac 
atgcttgctt 
tgaagcccta 
agcaagtatt 
ttagtgtgtg 
gaggcaatag 
actcgttaca 
ctagaccaga 
tgtttgctta 
gtttaagaaa 
ctgtacaatc 
aacaaaaatg 

OT ^ggatac£ga_ 

tttgtgtttt 
agaggtgcat 
aggctaccct 
ggacacgctt 
cacagctcac 
ttagtgttgg 
atcatggcaa 
aggatgatga 
gtaagcctta 
tcagttccac 
ttcagtagca 
caaaggggat 
tcaattttct 
- tccattccga 
cctcactatc 
ctctgcttac 
tgatctctgg 
taccactctc 
gttacatctt 
cgctcccagt 
ttactcatgc 
ctacaacctg 
cttcatctta 
aagaagggga 
tgtctgatgt 
tatgcgttac 
tgctcaagcc 
catgtttgtg 
aggtatgtgc 
cttttattat 



cgttctcttt tgattatctg tttctttatg 
aaggatttga aa&tttaacc actgaggatc 
aagtgatgtt ccttcatgta gcccagcttt 
aaaagaaaaa cagaatttta ttcaacttca 
gacacacaca cacacacaca cacacacaca 
tttaaacact gttttttgga aatagactct 
tgtagctaag gaggatctta aagccctgct 
tagagcctct atgcctgcct tctatggtgc 
aagccagaac tcctaactcc taacatcatc 
accagttagt gtaggaacaa atgtctcata 
ctagcatttt ctctctaccg tgtaattctt 
actgtataca tgtcctaagt gttttttttt 
ccagctcctt tagaatcccc cacacacctc 
tctttttcaa aataacccat tgactctaaa 
tagggtcatc cactgaagca tggccaactt 
tgtccttccc ctagcagctg ccaacagctt 
ggcttctgtg ctccttagac aggctcttcc 
ataaaggcgt tggtgattct aagacacaaa 
tgaaactcat aaccacgggc tactgtttgt 
ttaggttagt gttgaaccaa ctatgcagga 
ttttaaaaaa tcaataaacc atttgaatag 
aatgcaagag tccaatctgt aaagtcgcat 
gacaaaatca aaaccaaaag ccaacaatgc 
gtgcacagcc ttttaaaata tccttattta 
aggctctaaa gggaggccct tgatgacctg 
aaagtctcca cagcctctgg tgcactgagt 
gccattctta gtttctattt cattatgcaa 
tttttttttt ttttacaaaa tatgaactaa 
agaatgatat cattttagga gttttttttt 
aaa^atactg aatttactat ttatataatg 
tctcttttgc taggtactat gagaccagca 
aatcctagcg ctgatgaaag tgttcattct 
tcggctcctc taactggtaa cactgtgcag 
ccttttctcc ccttttcatt gcactacaaa 
aggcgtctct gtggtggtga aagcatttat 
caggtcatca gtgagaattc tatttgtaac 
ttctttccct atttaattgc gaagccattt 
tgtaatctac agccaacatc ccagtatcaa 
ctgtagataa tagagataca cggagagtgg 
agcctcctcc ttggtccacg tgatgacaac 
cacctaatgc gcccactctt gaggttcagg 
tgtctgtgcc ggccatcaca tcggtagcca 
tacactgttt gagagaagaa aagacaatca 
tgaaaattca cttatattaa tcttttcaga 
ttacttctac agaagaatga gtgaacttca 
ggcgacaaca gtgcttttct caggtacaga 
acaggcaatc actaagttct ttttatcgaa 
atcccatcat gctttgctct ctatgtgcat 
tcttgcatct taaagaaatg ctttcttgcc 
ccctgataga gagaggttct cagggagtta 
tcactcctga gttcaccaca gtcacatttc 
tcaggatctt caggacatgc tatcactgac 
cttgccccac tggtgtattt tgtagagagt 
tatgttgttt cctgtgagcc cttcagtgga 
gttaggagaa tttctcatgc ttctttatgt 
aaaccggacc caggttgaag tgagacagtt 
agcgacgtct tcctagggga ccattgtcct 
ttcttttata ttctaacttg ctgtgatatc 
atctgaagtt gaatcacata ggctccagag 
gctttattcc aagaagtttt ggaagcaact 



ctttatttat ttatttattt 
aggctgcatt acagaaggcg 
atggtgggaa agactcaacc 
agtacacaca gacagacaga 
cacacacacg cacgcacgca 
catgtagtcc aggctatgta 
gtgtctgctc ttacctgcca 
tggggacaga agccagggtc 
agcctttcat ttatcttttc 
gtagctgttt ataaatagtt 
ttaaaaattg tatttcttat 
ttaaatctta ctcacctcac 
tctcctaact tcatgtcctt 
tagtgctgct catatacacg 
agagggctca gttccctaaa 
ctcagctgtg ggtggggctt 
aatgaggagg acttttatgg 
gcagctctgg ggacggccgc 
ggaacgtctc tttactgggg 
cctcacttca ataaaaaaaa 
agcaggaaag ccatcattgt 
gttggttgag gacatgggaa 
tagctgcctt cactagctgt 
ttgattgttc tctgcaaaca 
gtccatgatt gcttgagtag 
ggttcgtgga gggtgtgtgt 
aactcatttt ggagggtctt 
tatctgtttt gtgttccagt 
cccccacata atagttatgc 
attaagaaag actatcaaca 
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a^c^ciTc^^gggai^iTt^^9G<>^ 
ccggaaaact ttctcaagga 
taggaagcat caccttgggt 
gagaaggcca tgggctttct 
agacacagcc aatacagttg 
cattttctta ctttggcttc 
ggggcctctg aattattctt 
ccacttttat gggtagaagg 
cttctgtagg ctaaaggctc 
ctctctttgt tctggtcctt 
cctgggcttc aagggcacaa 
tcttgaaccc cactctcttt 
caatgtgttt gcttctgtgc 
tattactaaa gaatttattg 
tgtatcggat actgaatatg 
ctgagaagta aggctcggag 
ctatgaactg aagttgaatg 
tcctttaagc ttaccactgg 
ctcatagccc ctcagatata 
gctatattct agtttccacc 
actccaccct ctccagtcca 
tcaactggtc aatgctctgt 
atcataagca tctgaaagaa 
attttgaagc acggtgaaga 
cagttcacag tcttcttgac 
acttagcatg ggtaacccat 
gtgtccttca ccgattcggc 
agggttcgtg aatgttcact 
cagctccatt atattgtcca 
gtgtgacaaa caggataaag 
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Fig. 12 continu d: 



gatgrtagaag gctggaggga cagcagtgtt ggctatcatc 
atcagctcag actgttacct agggacaggt cactctggtt 
ctttcagttc agtgtaggga attaagctac aaattctgga 
ctggtgttct ggctgtgtga ggctttcccc atctctcagg 
tggccatcat cctttatcct tcttggtggg tcttttaaaa 
ctacatttaa gaatttcaat ggggccacag gcgcccctag 
atatgttcat cacatccctt cagctgatct ttgggtttgc 
aacaatttac cttttcataa tcttgccatt tctacttact 
gttctccttg tagactgcag tccacctaac aacttgcctc 
caagcagaac taaaatcatg caaagaattg cagaagcatg 
ctccaaaaaa accccaaaac aaaaacaaaa acaaaaacaa 
tcactaactc acataggtgg tttgtttcct tttgttggac 
atctatctat ctatctatct atctatctat ctatctatct 
tactatctat ctatctatct atctatctat ctatctatct 
atctacctac ctacctatct atctatctat ctatctatct 
atctatctat ctatctatct atctatctat ctatctatct 
tacatgtcat gcttgtggag gtcagaggac aacctatggg 
catgtgtgtt ctggggttca ggtagtcagg ctgggtgata 
tcccactggc agctttgttg tttcacggaa tctctggaac 
gttcattggc tgttctttct gtgccttctt tattctgtca 
aaatatttaa gttgtcctag gacccagttt tcttttaaat 
atgtttcgcc tgcatctatg caccatggtg tgcctcacgc 
aggtcccttg ggacttgact tacagaggtt gtgacaaggc 
cttccttcct gattccatcc taactcttgg cattaaaggc 
cagacgtatc tgtgtgctcc cccagaatgt attaaatagc 
accgtcaaga aggtgtatca aatttaaaat gcccaaagct 
ccttagcttt tggtattcct cccccatctg gaccagacca 
catgtgcaac tttctttttt aatccttagt ttcttttctt 
atgcacccgt tctgtttatt caaccctaag atatacccag 



tttatagact ctctctgtat tagtacacag aggggttccc 
tggtatgcct ctggaactgt cgatggcttc ctagtttgca 
gggggggggg ggctggagag atggctcagt ggttaagagc 
tcctgagttc aattcccagc aaccatatgg tggctcacaa 
ctccatcttc tggagtgtct gaagacagct acagtgtact 
aaatcttaaa aaaaaaaacc aaaggtcctt ctattggtct 
catcctcgcc catccacatt gctttgacac aggtttccct 
acatgctctg ccatttttga tgcatctttt gccccctttc 
catcttacac tgttttgcca ttttatctac ttgttttttt 
ggttaaatgt tgttgtatcc ctgattaatc tttacaaata 
gtcttattta tttctgatta ctttttttaa tggcacatga 
taaaaaaacc tttttcattc attgataaaa agcacttact 
ctttgtctaa atgtgcaccc cttgtgtgct gtgagtccac 
ggaactggag ttacagatgg ccgtgagcca ccatgtgggt 
cttgggaaga gcagccagtg ctcttagctg ttgataggtc 
ctgaagatgt gtgtaagtct ctccctctcc acttcttcca 
taaagggaca caatcttgcc tgtgagtttc attgtggtat 
gaatgaaggc tgcctaagcc ttcggaactt cctggttagc 
caagaagtct cacgctttac ctctgttgac tgtagagagg 
gtttcaggca agcgtttatg gctgttatat ccccaacctt 
ccatgtgact ttatggaaat tacttaactc cctacatttc 
atgtgataaa aatagaatgt agctcaatag aatatagtga 
caaaaagaat ttagttagaa aacctaatgt gtattagggt 
tataactctg ttaataactc attgttttaa gtgcattcat 
attattaaga taaatatgat gtttttcaga tcgtccatgc 
agaaaaccta caagaaccag tcctgcaact tttgtttaag 
agaagaccca cagcatttcg cccacctcat agggaggctt 
tcacagccac tctgaaatcc ttcgcatgtg gaaaacaaag 
attctctgag aaatgggatc tgcactcatt ttcctgaaaa 
cttaaaattt caaaatcaaa attggatgaa tgatttgaaa 



tgaatctgaa 
ggaccataaa 
tggtgcagta 
agtgggactg 
caaggactgt 
attataaagt 
agggctaata 
gtttagatgg 
cagccttgtt 
gcagtctcag 
aaacaaaaaa 
atttacaatt 
atctatctat 
atctatctat 
atctatctat 
atccatccac 
aattgtttct 
gcatcttccc 
atcaccctct 
tcttcgtctc 
tattttatga 
tgctgagttc 
ccagttttct 
ctgtgtggat 
ctcctcaagt 
ggactcttga 
tagcttcgtc 
tctcttatat 
cacttaaccg 
^tgcgttttct 



tatttggaaa 
actcttcttt 
ccttagagcc 
ataacgcctg 
gtcatgctct 
tctcataaag 
tttgattccc 
aaaagagtaa 
aacagtacca 
gaagtttcca 
ccccacccac 
ctatctatct 
ctatctatcc 
ctatctatct 
cctactatct 
tgtgtgtgtg 
ctctttccac 
tgctgggcta 
taatttctct 
tcacaatgac 
ttttatgcac 
agaagatatc 
tctgcactgg 
tcgatactcc 
tttccatgta 
cacctttata 
ttccaaattg 
tcatatcaca 
cttcctaccg 
gactctatgc 



tggaaatata 
tagattaaaa 
agtgactgct 
ccatctgtaa 
cacatataat 
agtttgtctt 
tgatctcgca 
cttggatgcc 
cttttacctt 
gctaccaccc 
ttattttgtt 
ttccttgatg 
aagaaggtgt 
gtggaaatca 
tttccagact 
cattaagaat 
agttagcaag 
tcatttctta 
ttacttacta 
actcaccgat 
agggagctaa 
agattaagta 
tctgcaagta 
tattcccacc 
cttaaaaata 
ttttcaaaaa 
actgaactga 
gaccccaggt 
tttacagtgg 
aaaatgtgat 



agtcagaFcpT 

ccaaaggtcc 

cttccaaagg 

tgagatctga 

aaaaataaat 

gctccttttt 

ggtttctgct 

cacatgatgc 

tttgggtttg 

ccccccaaga 

tttgcttctt 

tgtatgtctc 

catatcccct 

aaccccagta 

gtttttactt 

ataaggacaa 

acgactggat 

cgtaggaacc 

ctattttaaa 

ccctttcttc 

tctgaaaaaa 

aagtcatgaa 

cttatgattt 

actttgaatc 

agcagcatat 

tgtaccatcc 

gaactctgag 

tggtgatgtt 

tgagacttgc 

aactcaaaat 
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Fig. 12 continued: 

cattgagtta tcacacacca ccttaaaatg gagatccaaa aggctcatca gggtttgtgg 14400 

aactttcaac tcttagaaga tgaacactcc aggattcatc tcaagattgg atgcaaagga 14460 

aagaaaaatg gagccatgtg gtggcatagg ctctgtagtt gagagacaga gacaagagga 14520 

ccccatgttt aaggcagtcc tgggctatgt agtgagacct catctcagaa gaaaaaagag 14580 

aagaactgat ggccatcatt atgtctccaa tcctcaccta aagaaactaa ttttttgttg 14640 

ttgttgttta gctgaaacac ataatgtgta aatgtgaaca caggtaggtt atggttagga 14700 

taaggcagtt tgtatgcttc actggttaag gacacatgga tggtgtagca gctaatgctt 14760 

agaacctctt cttcatgaat gattaacgag gaacaagcct tttcaaattg tatctgcttt 14820 

atatttttaa taagtatgtg ataattgtac agaatagtca atattgtaca gaatatttca 14880 

atatgggaat ttcatgtgta tatattagat atttctggcg, tatttgccac ccaccctgtg 14940 

atgcccacac tgccaatgcc cttctcatcc ccctaacatt tcccttcctc tgccttgccc 15000 

taaaacacca gattcatacg tgcaagaaaa cattttttaa acaaatttta ttagatattt 15060 

tcttcattta caattcaaat gctatcccaa aagtccctta ttccctcccc ttatcctgct 15120 

ctccaaccca cccacttctg cttcctggcc ctggtattcc cctgtattgg ggtatataat 151B0 

ctttgcaaga ccaagggcct ctcctcccaa cgatggtcca ctaggccatc tgcaagaaaa 15240 

catttgattc cagtctgagt ctgttttatt tcctttgagt atgactatct cgtcaatatc 15300 

tgcaaatgac ataatttcat tcctcttaag gatggaaagg actggattgt gtatgtatat 15360 

ctgtatatat tattttattt atccacctgc tgatgacacc tagtttgatt ctgtatcttg 15420 

gccactgtga atcgtgctat aataaaccta tgtaggtttt tagattttat gtggctcttg 15480 

attctttcag gtgtactccc agaagtggta tggtttatat ggtagctcta tttttagatt 15540 

tgaagggaac tatgtagtga ttttcacagt gattaccttc ttctcaatag tgtataaaga 15600 

ttaatttgcc ttcaaaattc ttgccaattt ttgtttttcc ctttcctttt ctttttaatt 15660 

aaaaaagtgt gtgtgtgtgt gcgtgcatgc caaagaacat tgtggacatc ggaggacagc 15720 

tctgtggaat acattctctc cttccacctt tatgggtggt ctcctctgac cccctctgcc 15780 

atggcctgca tggcttggtt ctggaagcct gattggggtg atgagcgaat gaagaaatga 15840 

cagacatgaa gacacaagta cagaaaagct gagattgggt ggtgtctaag tttctttttt 15900 

attgctataa agagacacca ggaccttatg aaagaaagca ttaaattggg ggcttgctta 15960 

cagtcttaaa gattgcgtcc atgatgatca tggcaggtag catggcacca ggcatgcact 16020 

ggaacagtat ctgaagagct tacattctga tcggcaagtt ggaggcaggg ggtgggggga 16080 

. gaagaaa^tg gctgatgggg cctgggacag gatttggaaa tctcaaagcc catccctagt 16140 
gacacacctc ctccaacgag gccacaccca gtcctscctca aat&gyLeea c^sgfe^s^— 
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Fig. 13: 

ccctacttgg tggacctatt tgaggaggac tgggtgtggc ctcgttggag gaggtgtgtc 60 

actagggatg ggctttgaga tttccaaatc ctgtcccagg ccccatcagc cactttcttc 120 

tccccccacc ccctgcctcc aacttgccga tcagaatgta agctcttcag atactgttcc 180 

agtgcatgcc tggtgccatg ctacctgcca tgatcatcat ggacgcaatc tttaagactg 240 

taagcaagcc cccaatttaa tgctttcttt cataaggtcc tggtgtctct ttatagcaat 300 

aaaaaagaaa cttagacacc acccaatctc agcttttctg tacttgtgtc ttcatgtctg 360 

tcatttcttc attcgctcat caccccaatc aggcttccag aaccaagcca tgcaggccat 420 

ggcagagggg gtcagaggag accacccata aaggtggaag gagagaatgt attccacaga 480 

gctgtcctcc gatgtccaca atgttctttg gcatgcacgc acacacacac acactttttt 540 

aattaaaaag aaaaggaaag ggaaaaacaa aaattggcaa gaattttgaa ggcaaattaa 600 

tctttataca ctattgagaa gaaggtaatc actgtgaaaa tcactacata gttcccttca 660 

aatctaaaaa tagagctacc atataaacca taccacttct gggagtacac ctgaaagaat 720 

caagagccac ataaaatcta aaaacctaca taggtttatt atagcacgat tcacagtggc 780 

caagatacag aatcaaacta ggtgtcatca gcaggtggat aaataaaata atatatacag 840 

atatacatac acaatccagt cctttccatc cttaagagga atgaaattat gtcatttgca 900 

gatattgacg agatagtcat actcaaagga aataaaacag actcagactg gaatcaaatg 960 

ttttcttgca gatggcctag tggaccatcg ttgggaggag aggcccttgg tcttgcaaag 1020 

attatatacc ccaatacagg ggaataccag ggccaggaag. cagaagtggg tgggttggag 1080 

agcaggataa ggggagggaa taagggactt ttgggatagc atttgaattg taaatgaaga 1140 

aaatatctaa taaaatttgt ttaaaaaatg ttttcttgca cgtatgaatc tggtgtttta 1200 

gggcaaggca gaggaaggga aatgttaggg ggatgagaag ggcattggca gtgtgggcat 1260 

cacagggtgg gtggcaaata cgccagaaat atctaatata tacacatgaa attcccatat 1320 

tgaaatattc tgtacaatat tgactattct gtacaattat cacatactta ttaaaaatat 1380 

aaagcagata caatttgaaa aggcttgttc ctcgttaatc attcatgaag aagaggttct 1440 

aagcattagc tgctacacca tccatgtgtc cttaaccagt gaagcataca aactgcctta 1500 

tcctaaccat aacctacctg tgttcacatt tacacattat gtgtttcagc taaacaacaa 1560 

caacaaaaaa ttagtttctt taggtgagga ttggagacat aatgatggcc atcagttctt 1620 

^^fefc^t^^ ggactgcctt a ^ c ^9^ 168 ° 



tcctcttgtc tctgtctctc aactacagag cctatgccac cacatggctc^catttttctt; x/4G 

tcctttgcat ccaatcttga gatgaatcct ggagtgttca tcttctaaga gttgaaagtt 1800 

ccacaaaccc tgatgagcct tttggatctc cattttaagg tggtgtgtga taactcaatg 1860 

attttgagtt atcacatttt tttcaaatca ttcatccaat tttgattttg aaattttaag 1920 

gcaagtctca ccactgtaaa ttttcaggaa aatgagtgca gatcccattt ctcagagaat 1980 

aacatcacca acctggggtc ctttgttttc cacatgcgaa ggatttcaga gtggctgtga 2040 

ctcagagttc tcagttcagt aagcctccct atgaggtggg cgaaatgctg tgggtcttct 2100 

ggatggtaca tttttgaaaa cttaaacaaa agttgcagga ctggttcttg taggttttct 2160 

atatgctgct tatttttaag gcatggacga tctgaaaaac atcatattta tcttaataat 2220 

gattcaaagt ggtgggaata atgaatgcac ttaaaacaat gagttattaa cagagttata 2280 

aaatcataag tacttgcaga accctaatac acattaggtt ttctaactaa attctttttg 2340 

ttcatgactt tacttaatct tcactatatt ctattgagct acattctatt tttatcacat 2400 

ttttttcaga ttagctccct gaaatgtagg gagttaagta atttccataa agtcacatgg 2460 

~ gaagaaaggg atcggtgagt aaggttgggg atataacagc cataaacgct tgcctgaaac 2520 

tttaaaatag tagtaagtaa cctctctaca gtcaacagag gtaaagcgtg agacttcttg 2580 

ggttcctacg taagaaatga gctaaccagg aagttccgaa ggcttaggca gccttcattc 2640 

atccagtcgt cttgctaact ataccacaat gaaactcaca ggcaagattg tgtcccttta 2700 

ttgtccttat attcttaatg tggaagaagt ggagagggag agacttacac acatcttcag 2760 

aagtaaaaac agtctggaaa gacctatcaa cagctaagag cactggctgc tcttcccaag 2820 

tactggggtt tgatttccac acccacatgg tggctcacgg ccatctgtaa ctccagttcc 2880 

aggggatatg acaccttctt gtggactcac agcacacaag gggtgcacat ttagacaaag 2940 

gagacataca catcaaggaa agtaagtgct ttttatcaat gaatgaaaaa ggttttttta 3000 

aagaagcaaa aacaaaataa tcatgtgcca ttaaaaaaag taatcagaaa taaataagac 3060 

tcttgggggg gggtggtagc tatttgtaaa gat-taatcag ggatacaaca acatttaacc 3120 

caaacccaaa aaggtaaaag aaaaaaacaa gtagataaaa tggcaaaaca gtgtaagatg 3180 

gcatcatgtg ggcatccaag gaaagggggc aaaagatgca tcaaaaatgg cagagcatgt 3240 

agcagaaacc tgcgagatca ag&gaaacct gtgtcaaagc aatgtggatg ggcgaggatg 3300 

aaaaaggagc aagacaaact agaccaatag aaggaccttt ggtttttttt tttaagattt 3360 

atttattttt attatatgtg agtacactgt agctgtcttc agacactcca gaagatggag 3420 

tcagatctca ttacagatgg ttgtgagcca ccatatggtt gctgggaatt gaactcagga 3480 
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cctttggaag agcagtcact gctcttaacc actgagccat ctctccagcc cccccccccc 3540 



3600 



ggacctttgg ttttaatcta tgcaaactag gaagccatcg acagttccag aggcatacca 

tcatctgact tatatttcca gggaacccct ctgtgtacta atacagagag agtctataaa 3660 

gcatagagtc agaaaatcca gttagaaaag tggattcacc tacgctgccc aagctataca 3720 

cggtaggaag cggttaagtg ctgggtatat cttagggttg aataaacaga acgggtgcat 3780 

tgtgatatga atataagaga aagaaaagaa actaaggatt aaaaaagaaa gttgcacatg 3840 

caatttggaa gacgaagcta tggtctggtc cagatggggg aggaatacca aaagctaagg 3900 

tataaaggtg tcaagagtcc agctttgggc attttaaatt tgatacacct tcttgacggt 3960 

tacatggaaa acttgaggag gctatttaat acattctggg ggagcacaca gatacgtctg 4020 

ggagtatcga atccacacag gcctttaatg ccaagagtta ggatggaatc aggaaggaag 4080 

ccagtgcaga agaaaactgg gccttgtcac aacctctgta agtcaagtcc caagggacct 4140 

gatatcttct gaactcagca gcgtgaggca caccatggtg catagatgca ggcgaaacat 4200 

gtgcataaaa tcataaaata atttaaaaga aaactgggtc ctaggacaac ttaaatattt 4260 

gtcattgtga gagacgaaga tgacagaata aagaaggcac agaaagaaca gccaatgaac 4320 

agagaaatta agagggtgat gttccagaga ttccgtgaaa caacaaagct gccagtggga 43 80 

tagcccagoa gggaagatgc tatcacccag cctgactacc tgaaccccag aacacacatg 4440 

gtggaaagag agaaacaatt cccataggtt gtcctctgac ctccacaagc atgacatgta 4500 

cacacacaca gtggatggat agatagatag atagatagat agatagatag atagatagat 4560 

agatagtagg atagatagat agatagatag atagatagat agataggtag gtaggtagat 4620 

agatagatag atagatagat agatagatag atagatagat agatagatag atagatagta 4680 

ggatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 4740 

agatagatag aattgtaaat gtccaacaaa aggaaacaaa ccacctatgt gagttagtga 4800 

gtgggtgggg ttttttgttt ttgtttttgt ttttgttttt gttttggggt ttttttggag 4860 

tggaaacttc ctgagactgc catgcttctg caattctttg catgatttta gttctgcttg 4920 

tggtactgtt aacaaggctg gaggcaagtt gttaggtgga ctgcagtcta caaggagaac 4980 

ttactctttt ccatctaaac agtaagtaga aatggcaaga ttatgaaaag gtaaattgtt 5040 

gggaatcaaa tattagccct gcaaacccaa agatcagctg aagggatgtg atgaacatat 5100 

ctttatgaga actttataat ctaggggcgc ctgtggcccc attgaaattc ttaaatgtag 5160 

agagcatgac acagtccttg ttttaaaaga cccaccaaga aggataaagg atgatggcca 5220 

.^^^-.jMfl^Mflt. ^feoaflaaat qcrcrgaaagcc tcacacagcc agaacaccag 5280 

ggctctaagg tactgcacca tccagaattt gtagcttaat tccctacact gaactgaaag 5340 

aaagaagagt tttatggtcc aaccagagtg acctgtccct aggtaacagt ctgagctgat 5400 

tttccaaata ttcagattca gatgatagcc aacactgctg tccctccagc cttctacatc 5460 

ctttatcctg tttgtcacac agttgcttcc aaaacttctt ggaataaagc ataataaaag 5520 

tggacaatat aatggagctg ctctggagcc. tatgtgattc aacttcagat gcacatacct 5580 

agtgaacatt cacgaaccct gatatcacag caagttagaa tataaaagaa cacaaacatg 5640 

gccgaatcgg tgaaggacac aggacaatgg tcccctagga agacgtcgct ggcttgagca 5700 

atgggttacc catgctaagt aactgtctca cttcaacctg ggtccggttt gtaacgcata 5760 

gtcaagaaga ctgtgaactg acataaagaa gcatgagaaa ttctcctaac acatcagaca 5820 

tcttcaccgt gcttcaaaat tccactgaag ggctcacagg aaacaacata tccccttctt 5880 

ttctttcaga tgcttatgat actctctaca aaatacacca gtggggcaag taagatgaag 5940 

acagagcatt gaccagttga gtcagtgata gcatgtcctg aagatcctga caggttgtag 6000 

tggactggag agggtggagt gaaatgtgac tgtggtgaac tcaggagtga gcatgagtaa 6060 

" ggtggaaact agaatatagc taactccctg agaacctctc tctatcaggg actgggagcg 6120 

tatatctgag gggctatgag ggcaagaaag catttcttta agatgcaaga aagatgtaac 6180 

ccagtggtaa gcttaaagga atgcacatag agagcaaagc atgatgggat gagagtggta 6240 

cattcaactt cagttcatag ttcgataaaa agaacttagt gattgcctgt ccagagatca 6300 

ctccgagcct tacttctcag tctgtacctg agaaaagcac tgttgtcgcc gtaagcagag 6360 

catattcagt atccgataca tgaagttcac tcattcttct gtagaagtaa gatagtgagg 6420 

caataaattc tttagtaata tctgaaaaga ttaatataag tgaattttca tcggaatgga 6480 

gcacagaagc aaacacattg tgattgtctt ttcttctctc aaacagtgta agaaaattga 6540 

aaagagagtg gggttcaaga tggctaccga tgtgatggcc ggcacagaca atcccctttg 6600 

ttgtgccctt gaagcccagg cctgaacctc aagagtgggc gcattaggtg tgctactgaa 6660 

aaggaccaga acaaagagag gttgtcatca cgtggaccaa ggaggaggct gtggaactga 6720 

gagcctttag cctacagaag ccactctccg tgtatcjtcta ttatctacag taaggcttac 6780 

ccttctaccc ataaaagtgg ttgatactgg gatgttggct gtagattaca tcatcatcct 6840 

aagaataatt cagaggcccc aaatggcttc gcaattaaat agggaaagaa ttgccatgat 6900 

gaagccaaag taagaaaatg gttacaaata gaattctcac tgatgacctg ccaacactaa 6960 

caactgtatt ggctgtgtct ataaatgctt tcaccaccac agagacgcct gtgagctgtg 7020 
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tggccttctc tttgtagtgc aatgaaaagg ggagaaaagg 
atgcttccta ctgcacagtg ttaccagtta gaggagccga 
agttttccgg agaatgaaca ctttcatcag cgctaggatt 
ctgagggctt tgctggtctc atagtaccta gcaaaagaga 
ctttcttaat cattatataa atagtaaatt cagtatcttt 
tatgtggggg aaaaaaaaac tcctaaaatg atatcattct 
aaaacagata ttagttcata ttttgtaaaa aaaaaaaaaa 
aaaatgagtt ttgcataatg aaatagaaac taagaatggc 
tccacgaacc actcagtgca ccagaggctg tggagacttt 
aatcatggac caggtcatca agggcctccc tttagagcct 
gaacaatcaa taaataagga tattttaaaa ggctgtgcac 
aaggcagcta gcattgttgg cttttggttt tgattttgtc 
ctcaaccaac atgcgacttt acagattgga ctcttgcatt 
ctttcctgct ctattcaaat ggtttattga ttttttaaaa 
tgaagtgagg tcctgcatag ttggttcaac actaacctaa 
gagacgttcc acaaacagta gcccgtggtt atgagtttca 
ccagagctgc tttgtgtctt agaatcacca acgcctttat 
cctcctcatt ggaagagcct gtctaaggag cacagaagcc 
cacagctgag aagctgttgg cagctgctag gggaaggaca 
tgagccctct aagttggcca tgcttcagtg gatgacccta 
agcagcacta tttagagtca atgggttatt ttgaaaaaga 
agttaggaga. gaggtgtgtg ggggattcta aaggagctgg 
taagatttaa aaaaaaaaac acttaggaca tgtatacagt 
caatttttaa aagaattaca cggtagagag aaaatgctag 
aaacagctac tatgagacat ttgttcctac actaactggt 
atgaaaggct gatgatgtta ggagttagga gttctggctt 
tctgtcccca gcaccataga aggcaggcat agaggctcta 
gagcagacac agcagggctt taagatcctc cttagctaca 
ggactacatg agagtctatt tccaaaaaac agtgtttaaa 



agaaagccca 
acccaaggtg 
tccttgagaa 
agtgtcccag 
tgttgatagt 
gcataactat 
actggaacac 
aagaccctcc 
acacacaccc 
ctactcaagc 
tgtttgcaga 
acagctagtg 
ttcccatgtc 
acaatgatgg 
ttttttttat 
ccccagtaaa 
gcggccgtcc 
ccataaaagt 
aagccccacc 
tttagggaac 
cgtgtatatg 
aaggacatga 
gtgaggtgag 
ataagaaata 
aactatttat 
gaaaagataa 
gaccctggct 
tggcaggtaa 
tacatagcct 

^ l g U g t ^ t ^^ r ^ ^ ^ ^ _ _ _ _ 

tctgtctgtc tgtgtgtact tgaagttgaa taaaattctg tttttctttt 
ggttgagtct ttcccaccat aaagctgggc tacatgaagg aacatcactt 
cgccttctgt aatgcagcct gatcctcagt ggttaaattt tcaaatcctt 
aaataaataa ataaataaag cataaagaaa cagataatca aaagagaacg 
aaatgtttaa aaatatttac ttattaaccc taagtgcttc tggggaataa 
taaattgaaa tctgaataaa attctttttt tctattttgg ggcactttta 
gttacacttt cttacagcaa ggacaaatca acatagctgc attagccttg 
ttttgtaaag agtttctgta cctctaagca tataataaag ctgagttata 
ggtcaaactg tgacttgttt agtattatag aggaaacctt ctttgtttct 
aagaaagccc acatcattat aataattttt gtaaatattt agattgtcac 
aattgacaaa gacaaataaa catttttttt ttttttagtt agaagcaaca 
acccagggct tcacacacat tagacaggtg ctctgcctct gagcgataga 
caactagaaa tttctgtttg atatactatc cctttaaaaa aaagaaaatt 
gagatagctc agtgggtaag agcactgact actcttccta aaaagccctg 
ccagcaacca cacggtggct cgcaaccatc cgtaacgaga tctaacgccc 
cacctgaaga cagctacagt gtacttatga aatctttggg ccagaacgag 
cagagtgagc agaggtccta aaaattcaat tctcaacaac cacatgaagg 
tctgtacagc tacaatgtac tcagataaat aaataaataa ataaataaat 
ataaataatc ttttttaaaa attgggggct agaaagacag atcagcagtt 
cttgctcttg cagagactct ggttccaacg cccacatggt aattcacagc 
aaactctagt tctaggggac caaacactct cttctggcct ccacaatatg 
gatatgaata tataattgca atcaaaacat gcataacaca taaacataaa 
tccttaaaaa ataatgaaac agaattttac ctgggagtcc cttggtaaac 
cttgtatgtg caggactgat acctctgaga gtctcagaaa acttagttcg 
cctcctgcag ctggtccata aaaaaaataa ataaataaaa atgattcata 
ctgataatta ctaaggtgtt tcttttttac tgttcttgct atcatttcct 
attacaaatt atttcttgcc agcaatggat acatacaagt ttgcttgttt 
aatcatggat ttttggtgag cagtcactat ggtattcaga agccgatgtt 
tagagtcatg ttgtcctgaa cctataagga aacagagggc gcctgtaatg 
acacactgtt cataaagaca caagagactg tatggaagaa gcgtcttgcc 



aagcgtgtcc 

agggtagcct 

atgcacctct 

aaaacacaaa 

tcagtatcca 

catttttgtt 

gattgtacag 

tttcttaaac 

taagcaaaca 

tctggtctag 

tgtaacgagt 

ctattgcctc 

cacacactaa 

aatacttgct 

tagggcttca 

aagcaagcat 

gtttgatcac 

ccacccacag 

atccagtctc 

cccccacccc 

aggagcaggg 

tgggaaggaa . 

tctcacaaga 

atgaatgtga 

aaaataaaat 

actgagcaca 

ggctgatggt 

tactgagttc 

agtgtgtgcg 

tgtctgtctg 

:^KnMs milt mill iiiumji" aiaa-M 

tacttccaga 
cagtttttga 
aaaaataaat 
ttcataaatt 
caatagtata 
tcagcactag 
aagaatgtaa 
tataagtttt 
tgtttcatct 
tcatctgatt 
ttgctagaaa 
tctagtacca 
tggggctgga 
agttcaagtc 
tcttctggtg 
cggggccgac 
ctcacaacca 
aaataaataa 
aagagcctgt 
catcggtagg 
ttgtgtacat 
aataaataaa 
ttcattagcc 
gggttggaac 
gtatgtcttt 
aatatcattc 
ctcccaaggg 
cctcttgagt 
tgtgcttgtt 
cctttcccag 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
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8340 
8400 
8460 
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8640 
8700 
8760 
8820 
8880^ 
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9180 
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9840 
9900 
9960 
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atctggtgct ggatgacacg tgtttggtgt ctgctccttc atcttccact tggatggcag 10680 

ggtacagggc aggcccgtgc ttgaagttct tgcgaagtct ctttgactta cactggatct 10740 

ctgtgagcaa acctgtgaca ttacagaatt cacattaagt acttttggct cggaaaacaa 10800 

cactgtaggg tttgattaaa at'ggtcttca gtgagtaaat agtcctagat atctctcgtg 10860 

catgtctgca tctattaaat ttcctattct gcttttccct tcatagacac cttgtctgct 10920 

taaaaatagt tataacaaag attaagaatt aatttttcat ttccaaatta tgtttagatt 10980 

tttacataag agatttacac attagttgca tcttggaagt gaaaaagtca tttgatagaa 11040 

tagtagaatt acattatcta tgactattat cactggtttt attaataagg ttaaaaaaga 11100 

ctatcttagg atttttaaaa ataagctaag tcaatttttc ctatgtcaga caaagcattt 11160 

aattcttttt caaaggcaaa taatatttat tttaggattg tttttgtttg tttttaaata 11220 

ttttataatt acttgcatga gtctgtatgg ggaaatgcac aggtgagtgc aggtgcccac 11280 

agagcacaga ggggtcatag ccccttgaag tgagactata gctggttgtg agctgtagtg 11340 

tatgtactgg actggactgg actactctaa gtacttttag cagctactaa aagtatccaa 11400 

ctattaatat acctgtcgtt ttcaaagtat ttttaaaaac acacctataa aataaaaaac 11460 

taatgaggag ctgaagtata aacaaaaacc ttaaaacccc tctatattat tttattattt 11520 

gtcagagaac agaatacaca aaaaaataaa gccattttaa gcatattgag tattgaacat 11580 

ggtagtaaac acctataacc ccagctactt aagacactgt tacaagatga tctcacgttg 11640 

aataccagcc. atggcaactc agtgtttttt agtaggaaag gagtctatgg gtgtagctta 11700 

gtggtagatt gtctaggatg tgagaaacct gaactcaacc cccaggacca tgagaaggaa 11760 

aaccacaaga agagaacgaa gaggaggaaa. aggaggaaga agagggggaa gaggaggaag 11820 

aggaaaagaa aggaggaaag ggaaaggaga atgagggaag gaagaggagg aggggccgtc 11880 

agagcactaa cacatggcag tattaggaca actgaaaggt aaatgaagtg tgcagatcca 11940 

agcaatggtg gcagggtgcg gaccagctta tgctcgccaa tcaccgtgac tgctcaccaa 12000 

ctctctcgag cttcaccttg ccagaatttc tttaagtttc tttaagtgta ggatgactca 12060 

ttttatgcat agagttcctc agaggactcc aacttccctt tcacatgcat tgcctgccac 12120 

aggttaaaag acaacttttg agccagaaat tggagggcag aggcaggggg atctctgagt 12180 

ttgatgccag cctggcgtac atagcaagfet ccaggtcagc tacatagtga gatcctgaat 12240 

acaaattaaa ttaatgaata aaataaaaga caactttttc aaaggtttct ttggactgag 12300 

ttgtatctcc cactgctcat atgctaatgt gttgatattc agagatgagt cctgacactc 12360 

ttcaattgtc ctaagactgt agtgtgttag gactctgaga accccttctc agaaagggaa 12480 

gacttccttt tctcttcttt ctcctcctct ttctttcttt ttgggaacta ctaactttaa 12540 

ctttgcactt ttcaaggcag tatatatttt taggttaagt aacagaaaca tacaatacct 12600 

tttttaaatc actatggagg tttcttttat tttatgtgtg tgttttgctt gcatgtgtgt 12660 

atgcacacaa cactcatacc tgcagaggtc agaagagggc ctgggatgct caggaactga 12720 

gttataaatg attgtgaggg gccacgtgga agctggggac cgggcccagg tcctctccta 12780 

atgctcttaa ctgcctaact actcttcagg tctcagagaa catctaatta agatgaaact 12840 

atgactgatt tcaaaattag agcaatatag aacagggcaa catttatggg aatcactact 12900 

taaattgata atgggccatc cacgtggcgc aattactgta catatgtgtg tggtgtgcat 12960 

atacaaacga atacggccat gttcaaaata actcttgaag aatgctttat gtcctcaaga 13020 

aggttggaaa tttttaattt tttattaatt tttaaattca tttttattat atagaggaga 13080 

gtgtgaatac agtccttcac taacacaaat gatgcacctg agtttcccac atttgtggaa 13140 

atcacagggg tcagtcagca catccctagt gcaatgggcg attcttgccc tggaaaaacc 13200 

accaccacta tcatgtcctc cttgtcagga aggtaaggct gggaaaagtt aaatcccctc 13260 

agaaaagcaa gccacgaaca cactggcact tacattctgc caacatcccc accgccttac 13320 

acttcttcag tctgcactct tggcattttc tgcgcatgta catgtccatt tcacagtgac 13380 

caccgttctt gcaagaatac actgcattct tggtaatgct acgccggaaa aagcctaaca 13440 

acaaagatgg tcaaagtttg acatggatat gtttagttac ttctcatttt tattaacatt 13500 

ttacctttat agaaagtaat caagctaact agagtatttc aattaaatag ggtttttttt 13560 

attcccgaca tttttttaaa tatacattta tctggaaagt tagctaattc tataattctt 13620 

tatagttagt atagttcagt tgacatcctg aggctgggtc tttctttata acgtctggag 13680 

attaaatgga gtatcattat cacattaaga caactctcta ccctactata ggtaaattac 13740 

ctataagaat ttttttttta actgcattat acaaaatatc aagatcctac aatgcatctc 13800 

tcgagtttcc tttgagagaa tacatggtca cctaaaataa agaaaaaatt cagaaaagcc 13860 

tctcttttct aaaccctccc agtctgagtt gcagcccaca aaggtggtcc ctccctccct 13920 

catccgcaga tgctctctct ccctttccca gcacttgcca gatagttccc atcccttttc 13980 

ctccttcgaa ccttattaca cagtaatacc tgccatttcc tgctctccct cctatgagtt 14040 

gtcccaaagc cccaccactt actctcttgc agtggacact tgaaagatgg agtttcgcca 14100 
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tttcagttca acgaaggaag aaaaagctca gctcaggttt tcaaaactgt gctgggttct 14160 

tcagctcagt tttaacagta acccgtcacc gtcaagagca ggccacccct gcctaactac 14220 

tcgccacagg actctgtcac ctcatatcct ttggctattc tttgccagtg ttgaggttta 14280 

agatttacag gtaaaatttt accctaaaca aacacgttta tttgtgttac attgactaaa 14340 

ctgttttctg ttgtttgggt tgttttgttt tgtttcagag ggtgctagaa tcaaatccaa 14400 

aagtaggtgc taagtgctct accactggcc tgcattccca tcccgatgtt tatccatttc 14460 

tggagtggta gactgtttca gaactgttat ttatcagaga ggcttccgct ctttacaaat 14520 

atcattgcaa acactggccc ttgcttcccg actttcagct tgttctcttt ctatacgaaa 14580 

cactttttaa aatttattga cttagaatca tagaagtcat ttatttcaaa ttttctccaa 14640 

gtgtataaat tttatcaacc acataaaaag gtgtcatggt ttacagatct ttacattgtc 14700 

atgttttgtt ccagcatcaa attcttctag aaatagggaa aacgtttgct tatatgtatc 14760 

gagtttaaag taaagtgtga aagtcttcat caaaaatgaa gagacagacc tggtagtgaa 14820 

tgctcttaat gccagcagta aggatacaaa ggtaggtgga tctctgagag ttcaaggaca 14880 

gcctggfccta aacagtgagt tccaggacag ccagggatgc atagagagac cttttctcaa 14940 

aaaaagaaga aagatgagga ggagaagatg aggacgaagg ggaagacaag ggaggagaaa 15000 

aaaaaataaa gaaagaaatg gtccaggcat ggtggcccac acctttaatc ccagcaccta 15060 

gaaagcagag acaggcaggt atctgtgagt tcaaagccaa cctgatctcc atagtgaggt 15120 

ctaggccagc cagggctata caatgatact ctacttcaaa agagagagag ggggggaggg 15180 

agggagagag agagagagag agagagagag agagagagag agagagagag agagagagag 15240 

agagagagag agagagggag agagagagag agagagagag agagagagag agagagagag 15300 

agagagagag agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 15360 

aaagaaagaa agaaagaaag aggaagggaa agggaaaggg aaagggaaag. ggaaaggaaa 15420 

ggaaaggaaa ggaaaggaaa ggaaaggaaa ggaaagaaac taaccttaaa cttaacattt 15480 

tattctccta tctttgatca cacagattga tccctttaga catacttttt cctggttttg 15540 

tttctcatgt tttataacga ttgcacacgc acgtgcacac acacatacac acgtgtgcgc 15600 

gtgcgcagca ctgtctgttt ggttgcgcct tacccttacc tttgcacccc tcacaagtaa 15660 

gtgcattgta gtgatatcct gatgccttat caccgcagac catgcagagc atgtcatctt 15720 

gacccttcca gcgtaaagaa caagttgctc tgcacctttg tgccccaggg aattcatctt 15780 

catcatcgtg aacaaccaca taagtggact gaccagattc tcgagcgtcg agtccacacc 15840 

aactgcctcc actgtactgt ggatccaagc catgtgtgtt gaaatgactt tgtaaagatg 15900 
gggactgtaa cgctggagga aactgagcgcp^^ 

aatctgtatc acacagctga taatggaatt gttctggcaa aatatctggt tttattaaca 16020 

taaaaaagaa caaatcataa aaggtcaaaa aagttcataa ttgttttaaa cctcaaaatg 16080 

ttttatactc aacagaaagt tagttcaaca catctattcc atctcaggac ttgccgtcca 16140 

gcacaatagg aacacagtaa atggaactaa caatacctaa taaatgctat gaccccaaag 16200 
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Fig. 14: 

cDNA splice variant of L66 from Mus musculus (SEQ ID NO. 22) 



atgttaataa 
ttccaagaac 
ccatcttfcac 
tggtgtggac 
gaagatgaat 
caagatgaca 
cttacttgtg 
tcttgcaaga 
tgcagactga 
cagtgtaagt 
atccaagtgg 
aaaggggttc 
gtgactgctc 
ggttccaacc 
gggctaatga 
gctgcattac 
ggtgggaaag 
ctagaggtgc 
gaaggctacc 
gaatatgctc 
aagcagcata 
atgtaccatc 
agaactctga 
ttggtgatgt 



aaccagatat 
caccctattg 
aaagtcattt 
tcgacgctcg 
tccctggggc 
tgctctgcat 
aggggtgcaa 
acggtggtca 
agaagtgtaa 
caaagagact 
aagatgaagg 
aggacaacat 
accaaaaatc 
ccgaactaag 
agtttaccaa 
agaaggcgtc 
actcaacctc 
ataatcctag 
cttcggctcc 
tgcttacggc 
tagaaaacct 
cagaagaccc 
gtcacagcca 
tattctctga 



tttgccagaa 
tcaatattct 
caacacacat 
agaatctggt 
acaaaggtgc 
ggtctgcggt 
aggctttttc 
ctgtgaaatg 
ggcggtgggg 
tcgcaagaac 
agcagacacc 
gactctaact 
catgattccc 
ttttctgaga 
gggactccca 
aaaaactgaa 
tggaagtact 
cgctgatgaa 
tctaactgga 
gacaacagtg 
acaagaacca 
acagcatttc 
ctctgaaatc 
gaaatgggat 



caattccatt 
accgctcagt 
ggcttggatc 
cagtccactt 
agagcaactt 
gataaggcat 
cggcgtagca 
gacatgtaca 
atgttggcag 
ttcaagcacg 
aaacacgtgt 
caagaggaac 
ttgggagaaa 
ctctcagagg 
ggatttgaaa 
gtgatgttcc 
atgagaccag 
agtgttcatt 
agaatgagtg 
cttttctcag 
gtcctgcaac 
gcccacctca 
cttcgcatgt 
ctgcactcat 



atcagctgtg 
ttcctccagc 
cacagtacag 
atgtggttgt 
gttctttacg 
caggatatca 
ttaccaagaa 
tgcgcagaaa 
aatgtttgct 
ggcctgccct 
catccagcac 
atcggcttct 
caagcaaact 
tatcagtcct 
atttaaccac 
ttcatgtagc 
caaagccctc 
ctccggaaaa 
aacttcatgt 
atcgtccatg 
ttttgtttaa 
tagggaggct 
ggaaaacaaa 
tttcctga 



tgatacagat 
gttacagtcc 
tggaggcagt 
tcacgatgat 
ctggaagggt 
ctacaatgca 
tgcagtgtat 
atgccaagag 
cacagagatc 
gtaccctgcc 
cagatctggg 
gaataccata 
tctgcaggag 
gcacatacaa 
tgaggatcag 
ccagctttat 
agctgggaca 
ctttctcaag 
atcggatact 
ccttaaaaat 
gttttcaaaa 
tactgaactg 
ggaccccagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1428 



tcaggaaaat 
tgttttccac 
cctccctatg 
aaacaaaagt 
tggacgatct 
atgaagttca 
ttccggagaa 
gggctttgct 
tacatgaagg 
ggttaaattt 
. gactgatacc 
tttgcttgtt 
aagccgatgt 
gcfcggafcgac 
ggcaggcccg 
caaacattct 
tctgcgcatg 
cttggtaatg 
atatcctgat 
taaagaacaa 
aaccacataa 
gtactgtgga 
tggaggaaac 
cagctgataa 



gagtgcagat 
atgcgaagga 
aggtgggcga 
tgcaggactg 
gagaaaagca 
ctcattcttc 
tgaacacttt 
ggtctcatag 
aacatcactt 
tcaaatcctg 
tctgagagtc 
tctcccaagg 
tcctcttgag 
acgtgtttgg 
tgcttgaagt 
gccaacatcc 
tacatgtcca 
ctacgccgga 
gccttatcac 
gttgctctgc 
gtggactgac 
tccaagccat 
tgagcggtag 
tggaattgtt 



cccatttctc 
tttcagagtg 
aatgctgtgg 
gttcttgtag 
ctgttgtcgc 
cagttagagg 
catcagcgct 
tacttccaga 
cagtttttga 
ggagtccctt 
tcagaaaact 
gaatcatgga 
ttagagtcat 
tgtctgctcc 
tcttgcgaag 
ccaccgcctt 
tttcacagtg 
aaaagccttt 
cgcagaccat 
acctttgtgc 
cagattctcg 
gtgtgttgaa 
aatattgaca 
ctggcaaaat 



agagaataac 
gctgtgactc 
gtcttctgga 
gttttctata 
cgtaagcaga 
agccgaaggg 
aggattatgc 
ggttgagtct 
cgccttctgt 
ggtaaacttc 
tagttcgggg 
tttttggtga 
gttgtcctga 
ttcatcttcc 
tctctttgac 
acacttcttc 
accaccgttc 
gcacccctca 
gcagagcatg 
cccagrggaat 
agcgtcgagt 
atgactttgt 
atagggtggt 
atctggtttt 



atcaccaacc 
agagttctca 
tggtacattt 
tgctgcttat 
gcatattcag 
tagccttcct 
acctctagtg 
ttcccaccat 
aatgcagcct 
attagccctt 
ttggaaccct 
gcagtcacta 
acccctttcc 
acttggatgg 
ttacactgga 
agtctgcact 
ttgcaagaat 
caagtaagtg 
tcatcttgac 
t cat ct teat 
ccacaccaac 
aaagatgggg 
tcttggaaat 
attaacat 



tggggtcctt 

gttcagtaag 
ttgaaaactt 
ttttaaggca 
tatccgatac 
tgagaaagtt 
tcccagctga 
aaagctgggc 
gatcctcagt 
gtatgtgcag 
ectgeagaag 
tggtattcag 
cagatctggt 
cagggtacag 
tctctgtgag 
cttggcattt 
acactgeatt 
cattgtagtg 
ccttccagcg 
categtgaac 
tgcctccact 
actgtaaege 
ctgtatcaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1428 
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Fig. 14 - continu d: 

Protein splice variant of L66 from Mus musculus (SEQ ID NO. 24) 

MLIKPDILPE QFHYQLCDTD FQEPPYCQYS TAQFPPALQS PSLQSHFNTH GLDPQYSGGS 60 

WCGLDARESG QSTYVWHDD EDEFPGAQRC RATCSLRWKG QDDMLCMVCG DKASGYHYEA 120 

LTCEGCKGFF RRSITKNAVY SCKNGGHCBM DMYMRRKCQE CRLKKCKAVG MLAECLLTEI 180 

QCKSKRLRXN FKHGPALYPA IQVEDEGADT KHVSSSTRSG KGVQDNMTLT Q TCKHR LT.N T I 240 

VTAHQKSMIP LGETSKLLQE GSNPELSFLR LSEVSVLHIQ GLMKFTKGLP GFENLTTEDQ 300 

AALQKASKTE VMFLHVAQLY GGKDSTSGST MRPAKPSAGT LEVHNPSADE SVHSPENFLK 360 

EGYPSAPLTG RMSELHVSDT EYALLTATTV LFSDRPCLKN KQHIENLQEP VLQLLFKPSK 420 

MYHPEDPQHF AHLIGRIiTEL RTBSHSHSEI LRMWKTKDPR LVMUFSEKWD LHSFS 475 
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Fig. 15 

Human: 

SEQ ID NOs: 1 = all exons human 

SEQ ID NOs: 4 = NC fragment F'g.5 

SEQ ID NOs: 6 = poly A1 fragment Fig-® 

SEQ ID NOs: 8 = poly A2 fragment Fig.7 

SEQ ID NOs: 1 0 = GF fragment Fig.8 

SEQ ID NOs: 12 = SF fragment Fig.9 

SEQ ID NO. 3 = protein L66 from SEQ ID NO. 1 Fig.2 



SEQ ID NOs: 2 = RC of all exons human 
SEQ ID NOs: 5 = RC NC fragment 
SEQ ID NOs: 7 = RC poly A1 fragment 
SEQ ID NOs: 9 = RC poly A2 fragment 
SEQ ID NOs: 1 1 = RC GF fragment 
SEQ ID NOs: 13 = RC SF fragment 

SEQ ID NOs: 14 = primer 18 sRNA fwd 
SEQ ID NOs: 15 = primer 18 sRNA rev 
SEQ ID NOs: 16 = probe 18 sRNA 

Mouse: 



— — ~»~-~~ggcuoAjC^ JljSiiL. 

SEQ ID NOs: 1 8 = RC of cDNA L66 from Mus musculus Fig. 4 

SEQ ID NOs: 19 = protein L66 from Mus musculus Fig. 4 

SEQ ID NOs: 20 = genomic DNA L66 from Mus musculus Fig. 12 

SEQ ID NOs: 21 = RC of genomic DNA L66 from Mus musculus Fig. 13 

SEQ ID NO: 22 = cDNA L66 from Mus musculus (splice variant 1 ) Fig. 14 
SEQ ID NO: 23 = RC of cDNA L66 from Mus musculus (splice variant 1) Fig. 14 
SEQ ID NO: 24 = protein L66 from Mus musculus (splice variant 1 ) Fig. 14 



Fig.1 
Fig.5 
Fig.6 
Fig.7 
Fig.8 
Fig.9 

No Fig. 
No. Fig. 
No. Fig. 
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Fig;T6 



QUERY 1 PGIKRSRPTySSSRNKGQEEFCWOTOK^PSPYHYNALTCSGC^ 

P .K: .R S:.RKG :E.CWCGD:AS YHYNALTCEGCK GF 
HIT 107 PVTKKPRMGASAGRIKG -DELCWCGDRASG- - YHYNALTCEGCK GF 

FQCS IXQl^VYS CRJSTGSHCE^MYMRRKCQE CRIiKKYKAVGMLAEC LLTEIQCKLK 

F: SI. :NAVY.C:NG. :C MDMYMRRKCQECRL : K K. :GMXiABC LLTEIQCK K 
FRRSITKNAVYKOOJGGNCT^ 

RLQKNFKEKNHFYSNXKVEEEGVDHSFLS STTRPGCTSMELTEEEHQLINNTVAAHQKYT 
RL:KN.K: H . .:. : .EG D . : : STT : . : E . . ELT . : : . . L : ; Is : : .K. . 
RLRKNVKQ- "HADQTVNEDSEGRDLRQVTSTTKS CREKTELTPDQQTIIiHFIMDSYNKQR 

IPIiEETOLYIiQEHTNPELSFLQLSETAVIiHra 

:P E TN .L:E. : .E :FL L:E.A. H: : L: .FTK LPGF; .L :EDQ.AL KGS. 
MPQEITNKILKEEFSAEENFLILTEMATNHVQVLVEFTK^ 

TEVIFLHGAQLY1JTMIIS I CLILPYVWMKIHFRI SFIjGVTEEFITXIiFYFYKRMSICIjDVT 
.E. :FL. -A: : :N. . : S L :. RI . G:::E:IT :F FYX.:.:L.:T 

VEAMFLRSAEIFNKKIiPSGHSDL IiEERIRNSGISDEYITPMFSFYKSIGELKMT 

NTEYALIAATIVFS -DRPOuKNKQYMENIiXEPVI^^ 

. .EYALL.A. : : .S DR. :K: : : :E.L.EP:L. :L.K. .K: : .PE: P.HFA L: : T 
QEEYALLTATVILSPDRQYIKDREAVEKI^ 

ELRTIiNYOTSEILSTWKTKDPKIATIjLSE 384 
ELRT.Nr : H : E : L . :W: . .D K. . .LL.E 
ELRTFNHHHAEMLMSWRVNDHKFTPIiIiCE 471 
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Fig. 17 B 
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Fig. 18 A 




Fig. 18 B 
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Fig. 19: 
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Fig. 19 continued: 
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Fig. 19 continu d: 
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Fig. 19 conti nu d: 
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Fig. 20: 
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Fig. 20 c ntinued: 




00 CD 00 
H H H 
I I I 
H H H 
>.\ ^> 
h 0> O 

r» in vo 
vo ro en 
vo vo vo 

H H H 

O O O 



GO ' 00 
H H 
I 

H 



a>cooacso^Hoocoor~^^vo^^\"^ 

IHp)HPJr)P)HHNMririHH»jgH 
iitiiiiiiiiiiikjcos: 



H H H H H 



H 00 

vo in 

C> CO 
VO VD 
Hrlrl 

o o o 



in cn 
in <n 
ro in 



3 



VDcor*ino^«coint«*ovocnNcs 
u>pinHU)io^n(OHO(n(OQ 

vo^^iinininco^cointnt^r'^rtH 



i i w H cs oo in 

H H ri l fO H H 
"\ \ l H O I I 

,,o S,w 2 8 ^ vo 

|vo J cn S — ro ro 

^ ri N Q O 0\ C) 

vo W oi w o* i-* 

slangs 



^ t* to 

1 CO 



m in lo 

CO H H 



H <n oj cq co 



cn o 
in vo 



H CO 

vo in 



vo eo > u» o ^ o in h o vo «i> n « pi t « o in 
iVorocnHvoin^cncoHOcncoronNCAVDO 



£ E 9 m 




en in cn 
<n rt, o fl\ o 



SUBSTITUTE SHEET (RULE 26) 



WO 02/22817 



PCT/EP01/10323 



37/37 



Fig. 20 continued: 
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SEQUENCE LISTING 

<110> LION Bioscience AG 

<120> NOVEL MAMMALIAN NUCLEAR RECEPTOR L66 AND METHODS OF USE 
<130> L-0020-01-WO-01 
<160> 24 

<170> Patentln version 3.0 

<210> 1 
<211> 1152 
<212> DNA 
<213> • Homo- sapiens 

<220> 

<221> n 

<222> (!)..( 1152) 

<223> n may be a. c. g or t 



<4 ^ u " 1 . .u ,4. nasataannn araaaaaaaa 60 

120 



_ -aaflgg j t cjaOCcajcc^ acaggaagaa 
ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 
tgtgaaggtt gcaaagaaat acctatggta aaaaatttta aaactttttt attgggtttt 
tttcaatgta gcatcnnnca aaatgcagta tatagttgca ggaatggtag tcactgtgaa 
atggacatgt acatgcgtag aaaatgtcaa gagtgcagac tgaaaaagta taaggcagta 
ggaatgttgg cagaatgttt gctcacagaa atccaatgta aattaaagag acttcaaaag 
aactttaagg agaagaatca tttttactct aacatcaaag tggaagagga aggagtagac 
cacagttttc tatcatccac cactagacct ggaaaagaaa gcatggaact aactgaagag 
gaacatcagc tcattaataa cattgtggct gctcatcaaa aatataccat tcctttagaa 
gaaacaaatt tgtatctgca ggaacataca aatcctgaac tgagcttttt gcaactctca 
gagacagcag tcctacacat acgtgggcta atgaatttta ccaaggggct cccaggattt 
gaaaatttgg ccaatgagga tcaaactgca ctacagaagg gatcaaaaac tgaagtgata 
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240 

300 
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tttctccatg gggcccaact ttacaataca 


atgataattt ccatatgttt gattctaccc 


780 


tatgtttgga tgaaaataca ttttcgtatc 


agttttttgg gtgttactga agaatttatt 


840 


acannnctgt tttacttcta caaaagaatg 


agcaaacttg atgtaactaa tactgaatat 


900 


gctctgcttg cagcaacaat tgttttttca 


gatcgtccat gccttaaaaa taagcaatat 


960 


atggaaaatt tannngaacc agttttacaa 


atattgtata agtattcaaa aatgtatcat 


1020 


ccagaagacc cannncattt tgcccatctc 


atatggaagc atactgaact gagaactctg 


1080 


aattataacc attcagaaat acttagcact 


tggaaaacaa aggaccccaa attggctact 


1140 


ttactctctg- ag 




1152. 



<210> 2 

<211> 1152 

<212> JDNA 

<213> Homo sapiens 

<220> 

<222> (!)..( 1152) 



<223> n may be a. c, 


g or t 






<400> 2 








ctcagagagt aaagtagcca 


atttggggtc 


ctttgttttc caagtgctaa gtatttctga 


60 


atggttataa ttcagagttc 


tcagttcagt 


atgcttccat atgagatggg caaaatgnnn 


120 


tgggtcttct ggatgataca 


tttttgaata 


cttatacaat atttgtaaaa ctggttcnnn 


180 


faaattttcc atatattgct 


tatttttaag 


gcatggacga tctgaaaaaa caattgttgc 


240 


tgcaagcaga gcatattcag 


tattagttac 


atcaagtttg ctcattcttt tgtagaagta 


300 


aaacagnnnt gtaataaatt 


cttcagtaac 


acccaaaaaa ctgatacgaa aatgtatttt 


360 


catccaaaca tagggtagaa 


tcaaacatat 


ggaaattatc attgtattgt aaagttgggc 


420 


cccatggaga aatatcactt 


cagtttttga 


tcccttctgt agtgcagttt gatcctcatt 


480 


ggccaaattt tcaaatcctg 


ggagcccctt 


ggtaaaattc attagcccac gtatgtgtag 


540 
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gactgctgtc tctgagagtt gcaaaaagct cagttcagga tttgtatgtt cctgcagata 600 

caaatttgtt tcttctaaag gaatggtata tttttgatga gcagccacaa tgttattaat 660 

gagctgatgt tcctcttcag ttagttccat gctttctttt ccaggtctag tggtggatga 720 

tagaaaactg tggtctactc cttcctcttc cactttgatg ttagagtaaa aatgattctt 780 

ctccttaaag ttcttttgaa gtctctttaa tttacattgg atttctgtga gcaaacattc 840 

tgccaacatt cctactgcct tatacttttt cagtctgcac tcttgacatt ttctacgcat 900 

gtacatgtcc atttcacagt gactaccatt cctgcaacta tatactgcat tttgnnngat 960 

gctacattga aaaaaaccea ataaaaaagt tttaaaattt; tttaccatag gtatttcttt. 1020 

gcaaccttca caggtaagtg cattataatg atatggtgat ggtgatgctt tatcaccaca 1080 

aactacacag aattcttcct gtcccttatt tctcgaggaa gaataggttg gtctggacct 1140 
ttttattcca gg 



<210> 3 

<211> 384 

<212> PRT 

<213> Homo sapiens 

<220> • . 

<221> X 

<222> (D..C384) 

<223> X may be any amino acid 



^400> 3 

Pro Gly He Lys Arg Ser Arg Pro Thr fyr Ser Ser Sen Arg Asn Lys 
I 5 10 15 

Gly Gin Glu Glu Phe Cys Val Val Cys Gly Asp Lys Ala Ser Pro Ser 
20 25 30 

Pro Tyr His Tyr Asn Ala Leu Thr Cys Glu Gly Cys Lys Glu He Pro 
35 40 45 

Met Val Lys Asn Phe Lys Thr Phe Leu Leu Gly Phe Phe Gin Cys Ser 
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50 
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55 60 



He Xaa Gin Asn Ala Val Tyr Ser Cys Arg Asn Gly Ser His Cys Glu 
65 70 75 80 

Met Asp Met Tyr Met Arg Arg Lys Cys Gin Glu Cys Arg Leu Lys Lys 
85 90 95 

Tyr Lys Ala Val Gly Met Leu Ala Glu Cys Leu Leu Thr Glu He Gin 
100 105 HO 

Cys Lys Leu Lys Arg Leu Gin Lys Asn Phe Lys Glu Lys Asn His Phe 
115 120 125 

Tyr Ser Asn lie Lys Val Glu Glu Glu Gly Val Asp His Ser Phe Leu 

.... 130 •. •; -135- - -" • MO-. 5 ' ■ ~ - • 

Ser Ser Thr Thr Arg Pro Gly Lys Glu Ser Met. Glu Leu Thr Glu Glu 
145 150 155 • 160 

Glu His Gin Leu He Asn Asn lie Val Ala Ala His Gin Lys Tyr Thr 
165 170 175 

He Pro Leu Glu Glu Thr Asn Leu Tyr Leu Gin Glu His Thr Asn Pro 
180 185 190 

Glu Leu Ser Phe Leu Gin Leu Ser Glu Thr Ala Val Leu His He Arg 
195 200 205 

Gly Leu Met Asn Phe Thr Lys Gly Leu Pro Gly Phe Glu Asn Leu Ala 
210 215 220 

Asn Glu Asp Gin Thr Ala Leu Gin Lys Gly Ser Lys Thr Glu Val lie 
225 230 235 240 

Phe Leu His Gly Ala Gin Leu Tyr Asn Thr Met lie lie Ser lie Cys 
245 250 255 

• Leu He Leu Pro Tyr Val Trp Met Lys He His Phe Arg He Ser Phe 
260 265 . 270 

Leu Gly Val Thr Glu Glu Phe He Thr Xaa Leu Phe Tyr Phe Tyr Lys 
275 280 285 

Arg Met Ser Lys Leu Asp Val Thr Asn Thr Glu Tyr Ala Leu Leu Ala 
290 295 ' 300 
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Ala Thr lie Val Phe Ser Asp Arg Pro Cys Leu lys Asn Lys Gin Tyr 
305 310 315 320 

Met Glu Asn Leu Xaa Glu Pro Val Leu Gin He Leu Tyr Lys Tyr Ser 
325 330 335 

Lys Met Tyr His Pro Glu Asp Pro Xaa His Phe Ala His Leu He Trp 
340 345 350 

Lys His Thr Glu Leu Arg Thr Leu Asn Tyr Asn His Ser Glu He Leu 
355 360 365 

Ser Thr Trp Lys Thr Lys Asp Pro Lys Leu Ala Thr Leu Leu Ser Glu 
370 375 380 

-<210> 4 • '• ••' - ' ■■ - * - - : ■ - ' ' " ■ 

<211> 1077 

<212> DNA 

<213> Homo sapiens 

<400> 4 

cctggaataa aaaggtccag accaacctat tcttcctcga gaaataaggg acaggaagaa 60 
ttctgtgtag tttgtggtga taaagcatca ccatcaccat atcattataa tgcacttacc 120 



tgtgaaggtt gcaaagcatc aacaaaatgc agtatatagt tgcaggaatg gtagtcactg 180 

tgaaatggac atgtacatgc gtagaaaatg tcaagagtgc agactgaaaa agtataaggc 240 

agtaggaatg ttggcagaat gtttgcacac agaaatccaa tgtaaattaa agagacttca 300 

aaagaacttt aaggagaaga atcattttta ctctaacatc aaagtggaag aggaaggagt 360 

agaccacagt tttctatcat ccaccactag acctggaaaa gtgattcagg aaagcatgga 420 

actaactgaa gaggaacatc agctcattaa taacattgtg gctgctcatc aaaaatatac 480 

cattccttta gaagaaacaa atttctgcag gaacatacaa atcctgaact gagctttttg 540 

caactctcag agacagcagt cctacacata cgtgggctaa tgaattttac caaggggctc 600 

ccaggatttg aaaatttggc caatgaggat caaactgcac tacagaaggg atcaaaaact 660 

gaagtgatat ttctccatgg ggcccaactt tacagtcaga aacaatcagc ctctgaaagt 720 

tctgtgagaa tattaaatca ttcagattat acaccaaatt gtcacaatag gagtggtgat 780 
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L-0020-01-WO-01.ST25 
ggaaaaattt tacaatgaag aatgtccttc 


tactactcta 


840 


attggtttca agaagctcat 


ctggaaaatg gtgataataa tagaacttac 


ctcatacagt 


900 


attgtgacta ctacataaaa 


taatacatag agatcgtcca tgccttaaaa 


ataagcaata 


960 


tataoaaaat ttacaaaaac 


cagttttaca aatattgtat aagtattcaa 


aaatgtatca 


1020 


tccagaagac ccatagcatt 


ttgcccatct catatggaag catactgaac 


tgagaac 


1077 


<210> 5 
<211> 1077 
<212> DMA 
<213> Homo sapiens 




v 




<400> 5 

qttctcaatt caotatqctt 


ccatatgaga tgggcaaaat gctatgggtc 


ttctqqatqa 


60 


tacattttta aatacttata 


caatatttgt aaaactggtt cttgtaaatt 


ttccatatat 


120 


tgcttatttt taaggcatgg 


acgatctcta tgtattattt tatgtagtag 


tcacaatact 


. 180 


gtatgaggta agttctatta 


ttatcaccat tttccagatg agcttcttga 


aaccaattag 


240 


aqtaqtaqaa qqacattctt 


cattgtaaaa tttttccata gaacaaataa 


gacttctatc 


"loo 


accactccta ttgtgacaat 


ttggtgtata atctgaatga tttaatattc 


tcacagaact 


360 


ttcagaggct gattgtttct 


gactgtaaag ttgggcccca tggagaaata 


tcacttcagt 


420 


ttttgatccc ttctgtagtg 


cagtttgatc ctcattggcc aaattttcaa 


atcctgggag 


480 


ccccttggta aaattcatta 


gcccacgtat gtgtaggact gctgtctctg 


agagttgcaa 


540 


^aagctcagt tcaggatttg 


tatgttcctg cagaaatttg tttcttctaa 


aggaatggta 


600 


xatttttgat gagcagccac 


aatgttatta atgagctgat giuccucttc 


agitagttcc 


bou 


atgctttcct gaatcacttt 


tccaggtcta gtggtggatg atagaaaact 


gtggtctact 


720 


ccttcctctt ccactttgat 


gttagagtaa aaatgattct tctccttaaa 


gttcttttga 


780 


agtctcttta atttacattg 


gatttctgtg tgcaaacatt ctgccaacat 


tcctactgcc 


840 


ttatactttt tcagtctgca 


ctcttgaeat tttctacgca tgtacatgtc 


catttcacag 


900 
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tgactaccat tcctgcaact atatactgca ttttgttgat gctttgcaac cttcacaggt 960 

aagtgcatta taatgatatg gtgatggtga tgctttatca ccacaaacta cacagaattc 1020 

ttcctgtccc ttatttctcg aggaagaata ggttggtctg gaccttttta ttccagg 1077 



<210> 6 

<211> 331 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> n 

<222> (1)..(331) 

<223> "n" may be A. C. G~or X 



<400> 6 

cccaacttta cagtcagaaa caatcagcct ctgaaagttc tgtgagaata ttaaatcatt 60 

cagattatac accaaattgt cacaatagga gtggtgatag aagtcttatt tgttctatgg 120 

aaaaatttta caatgaagaa tgtccttcta ctactctaat tggtaatatg actcaatatg 180 



aaatattata ttggatgcaa aaatttgtat ataatgttta actttcttat actgctttga 240 
gatacaatga taatttccat atgtttgatt ctaccctatg tttggatgaa aatacatttt 300 
cgtatcngcc caaaaaaaaa aaaaaaaaaa a . 331 



<210> 7 

<211> 331 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> n 

<222> (1)..(331) 

<223> "H" may be either A. C. G or T 



<400> 7 

tttttttttt tttttttttt gggcngatac gaaaatgtat tttcatccaa acatagggta 60 
gaatcaaaca tatggaaatt atcattgtat ctcaaagcag tataagaaag ttaaacatta 120 
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tatacaaatt tttgcatcca atataatatt tcatattgag tcatattacc aattagagta 180 

gtagaaggac attcttcatt gtaaaatttt tccatagaac aaataagact tctatcacca 240 

ctcctattgt gacaatttgg tgtataatct gaatgattta atattctcac agaactttca 300 

gaggctgatt gtttctgact gtaaagttgg g 331 

<210> 8 
<211> 311 
<212> DNA 
<213> Homo sapiens 

<400> 8 • * - - ' . " • 

cccaacttta cagtcagaaa caatcagcct ctgaaagttc tgtgagaata ttaaatcatt 60 

cagattatac accaaattgt cacaatagga gtggtgatag aagtcttatt tgttctatgg 120 

aaaaatttta caatgaagaa tgtccttcta ctactctaat tggtaatatg actcaatatg 180 

aaatattata ttggatgcaa aaatttgtat ataatgttta actttcttat actgctttga 240 

gatacaatga taatttccat atgtttgatt ctaccctatg tttggatgag gaaaaaaaaa 300 



aaaaaaaaaa a 311 

<210> 9 

<211> 311 

<212> DNA 

<213> Homo sapiens 

<400> 9 

tttttttttt tttttttttt cctcatccaa acatagggta gaatcaaaca tatggaaatt 60 

atcattgtat ctcaaagcag tataagaaag ttaaacatta tatacaaatt tttgcatcca 120 

atataatatt tcatattgag tcatattacc aattagagta gtagaaggac attcttcatt 180 

gtaaaatttt tccatagaac aaataagact tctatcacca ctcctattgt gacaatttgg 240 

tgtataatct gaatgattta atattctcac agaactttca gaggctgatt gtttctgact 300 

gtaaagttgg g 311 
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<210> 10 
<211> 425 
<212> DNA 
<213> Homo sapiens 

<400> 10 

ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 

tggggcccaa ctttacagtc agaaacaatc agcctctgaa agttctgtga gaatattaaa 120 

tcattcagat tatacaccaa attgtcacaa taggagtggt gatagaagtc ttatttgttc 180 

tatggaaaaa ttttacaatg aagaatgtcc ttctactact ctaattgatc gtccatgcct 240 

taaaaataag- caatatatgg aaaatttaca • agaaccagtt ttacaaatat -tgtataagta- .- 300 ■ 

ttcaaaaatg tatcatccag aagacccata gcattttgcc catctcatat ggaagcatac 360 

tgaactgaga actctgaatt ataaccattc agaaatactt agcacttgga aaacaaagga 420 
cccca 



<210> 11 



425 



<212> DNA 

<213> Homo sapiens 

<400> 11 

tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 

gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 

ttgaatactt atacaatatt tgtaaaactg gttcttgtaa ' attttccata tattgcttat 180 

ttttaaggca tggacgatca attagagtag tagaaggaca ttcttcattg taaaattttt 240 

ccatagaaca aataagactt ctatcaccac tcctattgtg acaatttggt gtataatctg 300 

aatgatttaa tattctcaca gaactttcag aggctgattg tttctgactg taaagttggg 360 

ccccatggag aaatatcact tcagtttttg atcccttctg tagtgcagtt tgatcctcat 420 
tggcc 



425 
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<210> 12 
<211> 299 
<212> DNA 
<213> Homo sapiens 

<400> 12 

ggccaatgag gatcaaactg cactacagaa gggatcaaaa actgaagtga tatttctcca 60 

tggggcccaa ctttacagtc agaaacaatc agcctctgaa aatcgtccat gccttaaaaa 120 

taagcaatat atggaaaatt tacaagaacc agttttacaa atattgtata agtattcaaa 180 

aatgtatcat ccagaagacc catagcattt tgcccatctc atatggaagc atactgaact 240 

gagaactctg aattataacc attcagaaat acttagcact tggaaaacaa aggacccca 299 



<210> 13 

<211> 299 

<212> DNA 

<213> Homo sapiens 

<400> 13 

tggggtcctt tgttttccaa gtgctaagta tttctgaatg gttataattc agagttctca 60 



gttcagtatg cttccatatg agatgggcaa aatgctatgg gtcttctgga tgatacattt 120 

ttgaatactt atacaatatt tgtaaaactg. gttcttgtaa attttccata tattgcttat 180 

ttttaaggca tggacgattt tcagaggctg attgtttctg actgtaaagt tgggccccat 240 

ggagaaatat cacttcagtt tttgatccct tctgtagtgc agtttgatcc tcattggcc 299 



<210> 14 

<211> 24 

<212> DNA 

<213> Artificial 

<400> 14 

cgtgggctaa tgaattttac caag 24 



<210> 15 
<211> 22 
<212> DNA 
<213> Artificial 
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ggccccatgg agaaatatca ct 
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<210> 16 

<211> 31 

<212> DNA 

<213> Artificial 

<400> 16 

ccaatgagga tcaaactgca ctacagaagg g 31 
<210> 17 

<2ii> 1470; •- ..... . ... 

<212> DNA 

<213> Mus musculus 

<400> 17 

atgttaataa aaccagatat tttgccagaa caattccatt atcagctgtg tgatacagat 60 
ttccaagaac caccctattg tcaatattct accgctcagt ttcctccagc gttacagtcc 120 



ccsrtctttac 


^aaagteattfr 




«§g€t4g§at^€iGagtacag-±ggagg£sgt 


180_ 


tggtgtggac 


tcgacgctcg 


agaatctggt 


cagtccactt atgtggttgt tcacgatgat 


240 


gaagatgaat 


tccctggggc 


acaaaggtgc 


agagcaactt gttctttacg ctggaagggt 


300 


caagatgaca 


tgctctgcat 


ggtctgcggt 


gataaggcat caggatatca ctacaatgca 


360 


. cttacttgtg 


aggggtgcaa 


aggctttttc 


cggcgtagca ttaccaagaa tgcagtgtat 


420 


tcttgcaaga 


acggtggtca 


ctgtgaaatg 


gacatgtaca tgcgcagaaa atgccaagag 


480 


tgcagactga 


agaagtgtaa 


ggcggtgggg 


atgttggcag aatgtttgct cacagagatc 


540 


cagtgtaagt 


caaagagact 


tcgcaagaac 


ttcaagcacg ggcctgccct gtaccctgcc 


600 


atccaagtgg 


aagatgaagg 


agcagacacc 


aaacacgtgt catccagcac cagatctggg 


660 


aaaggggttc 


aggacaacat 


gactctaact 


caagaggaac atcggcttct gaataccata 


720 


gtgactgctc 


accaaaaatc 


catgattccc 


ttgggagaaa caagcaaact tctgcaggag 


780 


ggttccaacc 


ccgaactaag 


ttttctgaga 


ctctcagagg tatcagtcct gcacatacaa 


840 
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gggctaatga 


agtttaccaa gggactccca ggatttgaaa atttaaccac tgaggatcag 


900 


gctgcattac 


agaaggcgtc aaaaactgaa gtgatgttcc ttcatgtagc ccagctttat 


960 


ggtgggaaag 


actcaacctc tggaagtact atgagaccag caaagccctc agctgggaca 


1020 


ctagaggtgc 


ataatcctag cgctgatgaa agtgttcatt ctccggaaaa ctttctcaag 


1080 


gaaggctacc 


cttcggctcc tctaactgat attactaaag aatttattgc ctcactatct 


1140 


tacttctaca 


gaagaatgag tgaacttcat gtatcggata ctgaatatgc tctgcttacg 


1200 


gcgacaacag 


tgcttttctc agatcgtcca tgccttaaaa ataagcagca tatagaaaac 


1260 


ct&caagaac 


cagtcefegcfr acttttgttt aagttttcaa -aaatgtaeca tccagaagac - 


1320' 


ccacagcatt 


tcgcccacct catagggagg cttactgaac tgagaactct gagtcacagc 


1380 


cactctgaaa 


tccttcgcat gtggaaaaca aaggacccca ggttggtgat gttattctct 


1440 


gagaaatggg 


atctgcactc attttcctga 


1470 



<210> 18 

z2ii?""t^/0 ~ '~™~ — —•-•»••——■• — 

<212> DNA 

<213> Mus musculus 

<400> 18 

tcaggaaaat gagtgcagat.cccatttctc agagaataac atcaccaacc tggggtcctt 
tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 
cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 
aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 
tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 
atgaagttca ctcattcttc tgtagaagta agatagtgag gcaataaatt ctttagtaat 
atcagttaga ggagccgaag ggtagccttc cttgagaaag ttttccggag aatgaacact 
ttcatcagcg ctaggattat gcacctctag tgtcccagct gagggctttg ctggtctcat 
agtacttcca gaggttgagt ctttcccacc ataaagctgg gctacatgaa ggaacatcac 
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ttcagttttt gacgccttct gtaatgcagc ctgatcctca gtggttaaat tttcaaatcc 600 

tgggagtccc ttggtaaact tcattagccc ttgtatgtgc aggactgata cctctgagag 660 

tctcagaaaa cttagttcgg ggttggaacc ctcctgcaga agtttgcttg tttctcccaa 720 

gggaatratg gatttttggt gagcagtcac tatggtattc agaagccgat gttcctcttg 780 

agttagagtc atgttgtcct gaaccccttt cccagatctg gtgctggatg acacgtgttt 840 

ggtgtctgct ccttcatctt ccacttggat ggcagggtac agggcaggcc cgtgcttgaa 900 

gttcttgcga agtctctttg acttacactg gatctctgtg agcaaacatt ctgccaacat 960 

ccccaccgcc.*t^jeacttct -tcagtctgGa-xtcttggcat tttctgcgca tgtacatgtc -1020 

catttcacag tgaccaccgt tcttgcaaga atacactgca ttcttggtaa tgctacgccg 1080 

gaaaaagcct ttgcacccct cacaagtaag tgcattgtag tgatatcctg atgccttatc 1140 

accgcagacc atgcagagca tgtcatcttg acccttccag cgtaaagaac aagttgctct 1200 

gcacctttgt gccccaggga attcatcttc atcatcgtga acaaccacat aagtggactg 1260 
accagattct cgagcgtcga gtccacacca actgcHcc^ 

atgtgtgttg aaatgacttt gtaaagatgg ggactgtaac gctggaggaa actgagcggt 1380 

agaatattga caatagggtg gttcttggaa atctgtatca cacagctgat aatggaattg 1440 

ttctggcaaa atatctggtt ttattaacat 1470 

<210> 19 

<211> 489 

<212> PRT 

<213> Mus musculus 

<400> 19 

Met Leu He Lys Pro Asp He Leu Pro Glu Gin Phe His Tyr Gin Leu 
1 5 .10 15 

Cys Asp Thr Asp Phe Gin Glu Pro Pro Tyr Cys Gin Tyr Ser Thr Ala 
20 25 30 
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Gin Phe Pro Pro Ala Leu Gin Ser Pro Ser Leu Gin Ser His Phe Asn 
35 40 45 

Thr His Gly Leu Asp Pro Gin Tyr Ser Gly Gly Ser Trp Cys Gly Leu 
50 55 50 

Asp Ala Arg Glu Ser Gly Gin Ser Thr Tyr Val Val Val His Asp Asp 
65 70 75 

Glu Asp Glu Phe Pro Gly Ala Gin Arg Cys Arg Ala Thr Cys Ser Leu 



85 



90 



Arg Trp Lys Gly Gin Asp Asp Met Leu Cys Met Val Cys Gly Asp Lys 



100 



105 



-••Ala Ser Gly ryr. Hisdyr- Asn- Ala Leu Th.r Cys Glu Gly Cys -Lys Gly,;, 
115 120 I 25 

Phe Phe Arg Arg Ser He Thr Lys Asn Ala Val Tyr Ser Cys Lys Asn 
130 135 140 

Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gin Glu 



145 



150 



155 



165 170 . 1/3 

Leu Thr Glu He Gin Cys Lys Ser Lys Arg Leu Arg Lys Asn Phe Lys 
180 I 85 iy 

His Gly Pro Ala Leu Tyr Pro Ala He Gin Val Glu Asp Glu Gly Ala 
195 200 205 

Asp Thr Lys His Val Ser Ser Ser Thr Arg Ser Gly Lys Gly Val Gin 



210 



215 



Asp Asn Met Thr Leu Thr Gin Glu Glu His Arg Leu Leu Asn Thr lie 



225 



230 



240 



Val Thr Ala His Gin Lys Ser Met He Pro Leu Gly Glu Thr Ser Lys 
245 250 25b 

Leu Leu Gin Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser 
260 265 270 

Glu Val Ser Val Leu His He Gin Gly Leu Met Lys Phe Thr Lys Gly 
275 280 285 
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Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gin Ala Ala Leu Gin 
290 295 300 

Lys Ala Ser Lys Thr Glu Val Met Phe Leu His Val Ala Gin Leu Tyr 
305 ' 310 315 320 

Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro 
325 330 335 

Ser Ala Gly Thr Leu Glu Val His Asn Pro Ser Ala Asp Glu Ser Val 
340 345 - 350 

His Ser Pro Glu Asn Phe Leu Lys Glu Gly Tyr Pro Ser Ala Pro Leu 
355 360 365 

Thr Asp "lie Thr Lys Glu Phe He Ala Ser Leu Ser Tyr Phe Tyr Arg 
370 ' 375 380 

Arg Met Ser Glu Leu His Val Ser Asp Thr Glu Tyr Ala Leu Leu Thr 
385 390 395 400 

Ala Thr Thr Val Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn Lys Gin 
405 410 415 

His lie Glu Asn Leu Gin Glu Pro Val Leu Gin Leu Leu Phe Lys Phe 
420 425 430 

Ser Lys Met Tyr His Pro Glu Asp Pro Gin His Phe Ala His Leu He 
435 440 445 

Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser His Ser His Ser Glu He 
450 455 460 

Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Val Met Leu Phe Ser 
465 470. 475 480 

Glu Lys Trp Asp Leu His Ser Phe Ser 
485 

<210> 20 
<211> 16200 
<212> DNA 
<213> Mus musculus 

ctttggggtc atagcattta ttaggtattg ttagttccat ttactgtgtt cctattgtgc 60 
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tggacggcaa gtcctgagat ggaatagatg tgttgaacta actttctgtt gagtataaaa 120 

cattttgagg tttaaaacaa ttatgaactt ttttgacctt ttatgatttg ttctttttta 180 

tgttaataaa accagatatt ttgccagaac aattccatta tcagctgtgt gatacagatt 240 

tccaagaacc accctattgt caatattcta ccgctcagtt tcctccagcg ttacagtccc 300 

catctttaca aagtcatttc aacacacatg gcttggatcc acagtacagt ggaggcagtt 360 

ggtgtggact cgacgctcga gaatctggtc agtccactta tgtggttgtt cacgatgatg 420 

aagatgaatt ccctggggca caaaggtgca gagcaacttg ttctttacgc tggaagggtc 480 
/aagatgacat -gct'otgcatg:gtGtgcggtg ataaggcatc aggatatcac tacaatgcac ; -540 

ttacttgtga ggggtgcaaa ggtaagggta aggcgcaacc aaacagacag tgctgcgcac 600 

gcgcacacgt gtgtatgtgt gtgtgcacgt gcgtgtgcaa tcgttataaa acatgagaaa 660 

caaaaccagg aaaaagtatg tctaaaggga tcaatctgtg tgatcaaaga taggagaata 720 

aaatgttaag tttaaggtta gtttctttcc tttcctttcc tttcctttcc tttcctttcc 780 

ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctttct ctctctctct 900 

ctctctctct ctctctctct ctctctctct ctctctctct ctccctctct ctctctctct 960 

ctctctctct ctctctctct ctctctctct ctctctctct ctctctctct ctctctccct 1020 

ccctcccccc ctctctctct tttgaagtag agtatcattg tatagccctg gctggcctag 1080 

acctcactat ggagatcagg ttggctttga actcacagat acctgcctgt ctctgctttc 1140 

taggtgctgg gattaaaggt gtgggccacc atgcctggac catttctttc tttatttttt 1200 

tttctcctcc cttgtcttcc ccttcgtcct catcttctcc tcctcatctt tcttcttttt 1260 

ttgagaaaag gtctctctat gcatccctgg ctgtcctgga actcactgtt tagaccaggc 1320 

tgtccttgaa ctctcagaga tccacctacc tttgtatcct tactgctggc attaagagca 1380 

ttcactacca ggtctgtctc ttcatttttg atgaagactt tcacacttta ctttaaactc 1440 
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gatacatata agcaaacgtt ttccctattt ctagaagaat ttgatgctgg aacaaaacat 1500 

gacaatgtaa agatctgtaa accatgacac ctttttatgt ggttgataaa atttatacac 1560 

ttggagaaaa tttgaaataa atgacttcta tgattctaag tcaataaatt ttaaaaagtg 1620 

tttcgtatag aaagagaaca agctgaaagt cgggaagcaa gggccagtgt ttgcaatgat 1680 

atttgtaaag agcggaagcc tctctgataa ataacagttc tgaaacagtc taccactcca 1740 

gaaatggata aacatcggga tgggaatgca ggccagtggt agagcactta gcacctactt 1800 

ttggatttga ttctagcacc ctctgaaaca aaacaaaaca acccaaacaa cagaaaacag 1860 

tttagtcaat gtaacacaaa taaacgtgtt tgtttagggt aaaattttac ctgtaaatct 1920 

taaacctcaa cactggcaaa gaatagccaa aggatatgag gtgacagagt cctgtggcga 1980 

gtagttaggc aggggtggcc tgctcttgac ggtgacgggt tactgttaaa actgagctga 2040 

agaacccagc acagttttga aaacctgagc tgagcttttt cttccttcgt tgaactgaaa 2100 

tggcgaaact ccatctttca agtgtccact gcaagagagt aagtggtggg gctttgggac 2160 

lacT^TaW^ 

gaaaagggat gggaactatc tggcaagtgc tgggaaaggg agagagagca tctgcggatg 2280 

agggagggag ggaccacctt tgtgggctgc aactcagact gggagggttt agaaaagaga 2340 

ggcttttctg aattttttct ttattttagg tgaccatgta ttctctcaaa ggaaactcga 2400 

gagatgcatt gtaggatctt gatattttgt ataatgcagt taaaaaaaaa attcttatag 2460 

gtaatttacc tatagtaggg tagagagttg tcttaatgtg ataatgatac tccatttaat 2520 

ctccagacgt tataaagaaa gacccagcct caggatgtca actgaactat actaactata 2580 

aagaattata gaattagcta actttccaga taaatgtata tttaaaaaaa tgtcgggaat 2640 

aaaaaaaacc ctatttaatt gaaatactct agttagcttg attactttct ataaaggtaa 2700 

aatgttaata aaaatgagaa gtaactaaac atatccatgt caaactttga ccatctttgt 2760 

tgttaggctt tttccggcgt agcattacca agaatgcagt gtattcttgc aagaacggtg 2820 

gtcactgtga aatggacatg tacatgcgca gaaaatgcca agagtgcaga ctgaagaagt 2880 

Seite 17 



WO-02/22817. PCT/EP01/10323 



L-0020-01-WO-01.ST25 

gtaaggcggt ggggatgttg gcagaatgta agtgccagtg tgttcgtggc ttgcttttct 2940 

gaggggattt aacttttccc agccttacct tcctgacaag gaggacatga tagtggtggt 3000 

ggtttttcca gggcaagaat cgcccattgc actagggatg tgctgactga cccctgtgat 3060 

ttccacaaat gtgggaaact caggtgcatc atttgtgtta gtgaaggact gtattcacac 3120 

tctcctctat ataataaaaa tgaatttaaa aattaataaa aaattaaaaa tttccaacct 3180 

tcttgaggac ataaagcatt cttcaagagt tattttgaac atggccgtat tcgtttgtat 3240 

atgcacacca cacacatatg tacagtaatt gcgccacgtg gatggcccat tatcaattta 3300 

agtagtgatt cceataaatg ttgeGctgtt -ctatattgct ctaatxttga aatcagtcat 3360 \ 

agtttcatct taattagatg ttctctgaga cctgaagagt agttaggcag ttaagagcat 3420 

taggagagga cctgggcccg gtccccagct tccacgtggc ccctcacaat catttataac 3480 

tcagttcctg agcatcccag gccctcttct gacctctgca ggtatgagtg ttgtgtgcat 3540 

acacacatgc aagcaaaaca cacacataaa ataaaagaaa cctccatagt gatttaaaaa 3600 



aggtattgta tgtttctgtt acttaaccta aaaatatata ctgccttgaa aagtgcaaag 3660 

ttaaagttag tagttcccaa aaagaaagaa agaggaggag aaagaagaga aaaggaagtc 3720 

ttccctttct gagaaggggt tctcagagtc ctaacacact acagtcttag gacaattgaa 3780 

gggtaactgg atctctggac ccctgacatc ctcatatgta aagtccacta atagtctcac 3840 

gagtgtcagg actcatctct gaatatcaac acattagcat atgagcagtg ggagatacaa 3900 

ctcagtccaa agaaaccttt gaaaaagttg tcttttattt tattcattaa tttaatttgt 3960 

attcaggatc tcactatgta gctgacctgg aacttgctat gtacgccagg ctggcatcaa 4020 

actcagagat ccccctgcct ctgccctcca atttctggct caaaagttgt cttttaacct 4080 

gtggcaggca atgcatgtga aagggaagtt ggagtcctct gaggaactct atgcataaaa 4140 

tgagtcatcc tacacttaaa gaaacttaaa gaaattctgg caaggtgaag ctcgagagag 4200 

ttggtgagca gtcacggtga ttggcgagca taagctggtc cgcaccctgc caccattgct 4260 
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tggatctgca cacttcattt acctttcagt tgtcctaata ctgccatgtg ttagtgctct 4320 

gacggcccct cctcctcttc cttccctcat tctcctttcc ctttcctcct ttcttttcct 4380 

cttcctcctc ttccccctct tcttcctcct tttcctcctc ttcgttctct tcttgtggtt 4440 

ttccttctca tggtcctggg ggttgagttc aggtttctca catcctagac aatctaccac 4500 

taagctacac ccatagactc ctttcctact aaaaaacact gagttgccat ggctggtatt 4560 

caacgtgaga tcatcttgta acagtgtctt aagtagctgg ggttataggt gtttactacc 4620 

atgttcaata ctcaatatgc ttaaaatggc tttatttttt tgtgtattct gttctctgac 4680 

aaataataaa ataatataga ggggttttaa ggtttttgtt tatacttcag ctcctcatta 4740. 

gttttttatt ttataggtgt gtttttaaaa atactttgaa aacgacaggt atattaatag 4800 

ttggatactt ttagtagctg ctaaaagtac ttagagtagt ccagtccagt ccagtacata 4860 

cactacagct cacaaccagc tatagtctca cttcaagggg ctatgacccc tctgtgctct 4920 

gtgggcacct gcactcacct gtgcatttcc ccatacagac tcatgcaagt aattataaaa 4980 

tatl^^ 5040 



aaatgctttg 


tctgacatag 


gaaaaattga cttagcttat ttttaaaaat cctaagatag 


5100 


tcttttttaa 


ccttattaat 


aaaaccagtg ataatagtca tagataatgt aattctacta 


5160 


ttctatcaaa 


tgactttttc 


acttccaaga tgcaactaat gtgtaaatct cttatgtaaa 


5220 


aatctaaaca 


taatttggaa 


atgaaaaatt aattcttaat ctttgttata actattttta 


5280 


agcagacaag 


gtgtctatga 


agggaaaagc agaataggaa atttaataga tgcagacatg 


5340 


cacgagagat 


atctaggact 


atttactca.c tgaagaccat tttaatcaaa ccctacagtg 


5400 


ttgttttccg 


agccaaaagt 


acttaatgtg aattctgtaa tgtcacaggt ttgctcacag 


5460 


agatccagtg 


taagtcaaag 


agacttcgca agaacttcaa gcacgggcct gccctgtacc 


5520 


ctgccatcca 


agtggaagat 


gaaggagcag acaccaaaca cgtgtcatcc agcaccagat 


5580 


ctgggaaagg 


ggcaagacgc 


ttcttccata cagtctcttg tgtctttatg aacagtgtgt 


5640 


aacaagcaca 


cattacaggc 


gccctctgtt tccttatagg ttcaggacaa catgactcta 


5700 
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actcaaqacjq 


aacatcggct tctgaatacc 


atagtgactg ctcaccaaaa 


atccatgatt 


5760 


cccttaaaaa 


aaacaaqcaa acttqtatqt 


atccattgct ggcaagaaat 


aatttgtaat 


5820 


aaataatatt 

yuu i»yu bu b b 


aooaaatoat aacaaaaaca 


gtaaaaaaga aacaccttag 


taattatcag 


5880 


aaaaacatac 


tataaatrat ttttatttat 


ttattttttt tatqqaccaq 


ctqcaqqaqq 


5940 


crM~ rraarrr 


raaartaaat ttfctaaaac 


tctcaaaaot atcaatccto 

LfV>bv>uyuyy b u bv»uvj i*v*<w i»y 


cacatacaac 


6000 


nnr"t" a atrial 


ntttarraan nnartrrran 


ntaaaattrt atttrattat 

yuuuuQU-uuu ybbbb-ubbub 


tttttaaaaa 

bbbbbuuyyu 


6060 


LLLattLatlL 


ttta tg LLca tgtgttatgc 


atgtbttgai tgcaattaba 


Cat LCa tatC 


OJLxU 




ratattritaa ^Gdrcabaaa 

uuouuvy Lyy uyyu-v^uyauy 


aaadtattto GtcccGtaaa 

ciyuybybbby y i>bvu^ uwyu 


actaoaattt 


-6180 


rctarraata 

b>^* uuu^yu uy 


actataaatt accatataaa 


catfcaaaacc aaaotctctq 


caaqaqcaao 


6240 


araaartrtt 

uv-biy y \* bb* b b 


aartartaat rtatrtttrt 


aarcrccaat ttttaaaaaa 


aattatttat 


6300 


ttatttattt 

b l/U b L bU b b b 


atttatttat ttatttattt 

U b b bU b b bU b b bU b b b*4 b b b 


atttatctaa atacattcrta 

UbbbbJbb-byU yuiiV/llULiybU 


actqtacaaa 

yv^o^j L>uuuyu 


6360 


taattcrtaaa 

^•yy ^y ^y u 


ccttcatcrt0 ottattaaaa 


attaaatttt taoqacctct 


gctcactctg 


6420 


atcaacccca 


ctcattctaQ cccaaaoatt 


tcataaqtac actqtaqctq 


tcttcaqqtq 


6480 


caccagaaga 


qqqcQttaqa tctccrttacq 


qatqqttqcq aqccaccqtq 


tqqttqctqq 


6540 


gacttgaact 


caqqqctttt taqqaaqaqt 


agtcagtgct cttacccact 


gagctatctc 


6600 


tccagcccca 


aattttcttt tttttaaagg 


gatagtatat caaacagaaa 


tttctagttg 


6660 


tggtactaga 


tctatcgctc agaggcagag 


cacctgtcta atgtgtgtga 


agccctgggt 


6720 


tttctagcaa 


tgttgcttct aactaaaaaa 


aaaaaaaatg ttiaixxgxc 


tttgxcaatt 


D/oU 


aatcagatga 


gtgacaatct aaatatttac 


aaaaattatt ataatgatgt 


gggctttctt 


6840 


agatgaaaca 


agaaacaaag aaggtttcct 


ctataatact aaacaagtca 


cagtttgacc . 


6900 


aaaacttata 


tataactcag ctttattata 


tgcttagagg tacagaaact 


ctttacaaaa 


6960 


ttacattctt 


caaggctaat gcagctatgt 


tgatttgtcc ttgctgtaag 


aaagtgtaac 


7020 


ctagtgctga 


taaaagtgcc ccaaaataga 


aaaaaagaat tttattcaga 


tttcaattta 


7080 
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tatactattg ttattcccca gaagcactta gggttaataa gtaaatattt ttaaacattt 7140 

aatttatgaa cgttctcttt tgattatctg tttctttatg ctttatttat ttatttattt 7200. 

atttattttt aaggatttga aaatttaacc actgaggatc aggctgcatt acagaaggcg 7260 

tcaaaaactg aagtgatgtt ccttcatgta gcccagcttt atggtgggaa agactcaacc 7320 

tctggaagta aaaagaaaaa cagaatttta ttcaacttca agtacacaca gaeagacaga 7380 

cagacagaca gacacacaca cacacacaca cacacacaca cacacacacg cacgcacgca 7440 

cgcacacact tttaaacact gttttttgga aatagactct catgtagtcc aggctatgta 7500 

gaactcagta tgtagctaag gaggatctta aagccctgct gtgtctgctc ttacctgcca 7560 

accatcagcc tagagcctct atgcctgcct tctatggtgc tggggacaga agccagggtc 7620 

tgtgctcagt aagccagaac tcctaactcc taacatcatc agcctttcat ttatcttttc 7680 

attttatttt accagttagt gtaggaacaa atgtctcata gtagctgttt ataaatagtt 7740 

tcacattcat ctagcatttt ctctctaccg tgtaattctt ttaaaaattg tatttcttat 7800 

i£itgtgaga^actoka^ ctcacctcac 7860 

ttccttccca ccagctcctt tagaatcccc cacacacctc tctcctaact tcatgtcctt 7920 

ccctgctcct tctttttcaa aataacccat tgactctaaa tagtgctgct catatacacg 7980 

ggggtggggg tagggtcatc cactgaagca tggccaactt agagggctca gttccctaaa 8040 

gagactggat tgtccttccc ctagcagctg ccaacagctt ctcagctgtg ggtggggctt 8100 

ctgtgggtgg ggcttctgtg ctccttagac aggctcttcc aatgaggagg acttttatgg 8160 

gtgatcaaac ataaaggcgt tggtgattct aagacacaaa gcagctctgg ggacggccgc 8220 

atgcttgctt tgaaactcat aaccacgggc tactgtttgt ggaacgtctc tttactgggg 8280 

tgaagcccta ttaggttagt gttgaaccaa ctatgcagga cctcacttca ataaaaaaaa 8340 

agcaagtatt ttttaaaaaa tcaataaacc atttgaatag agcaggaaag ccatcattgt 8400 

ttagtgtgtg aatgcaagag tccaatctgt aaagtcgcat gttggttgag gacatgggaa 8460 

gaggcaatag gacaaaatca aaaccaaaag ccaacaatgc tagctgcctt cactagctgt 8520 
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actcgttaca gtgcacagcc ttttaaaata tccttattta ttgattgttc tctgcaaaca 8580 

ctagaccaga aggctctaaa gggaggccct tgatgacctg gtccatgatt gcttgagtag 8640 

tgtttgctta aaagtctcca cagcctctgg tgcactgagt ggttcgtgga gggtgtgtgt 8700 

gtttaagaaa gccattctta gtttctattt cattatgcaa aactcatttt ggagggtctt 8760 

ctgtacaatc tttttttttt ttttacaaaa tatgaactaa tatctgtttt gtgttccagt 8820 

aacaaaaatg agaatgatat cattttagga gttttttttt cccccacata atagttatgc 8880 

tggatactga aaagatactg aatttactat ttatataatg attaagaaag actatcaaca 8940 
tttgtcjtttt tctcttttgb taggtactat gagaccagca- aagcccteag* ctgggacaet . * 9000- 

agaggtgcat aatcctagcg ctgatgaaag tgttcattct ccggaaaact ttctcaagga 9060 

aggctaccct tcggctcctc taactggtaa cactgtgcag taggaagcat caccttgggt 9120 

ggacacgctt ccttttctcc ccttttcatt gcactacaaa gagaaggcca tgggctttct 9180 

cacagctcac aggcgtctct gtggtggtga aagcatttat agacacagcc aatacagttg 9240 
Trtigtgttg^ 

atcatggcaa ttctttccct atttaattgc gaagccattt ggggcctctg aattattctt 9360 

aggatgatga tgtaatctac agccaacatc ccagtatcaa ccacttttat gggtagaagg 9420 

gtaagcctta ctgtagataa tagagataca cggagagtgg cttctgtagg ctaaaggctc 9480 

tcagttccac agcctcctcc ttggtccacg tgatgacaac ctctctttgt tctggtcctt 9540 

ttcagtagca cacctaatgc gcccactctt gaggttcagg cctgggcttc aagggcacaa 9600 

caaaggggat tgtctgtgcc ggccatcaca tcggtagcca tcttgaaccc cactctcttt 9660 

tcaattttct tacactgttt gagagaagaa aagacaatca caatgtgttt gcttctgtgc 9720 

tccattccga tgaaaattca cttatattaa tcttttcaga tattactaaa gaatttattg 9780 

cctcactatc ttacttctac agaagaatga gtgaacttca tgtatcggat actgaatatg 9840 

ctctgcttac ggcgacaaca gtgcttttct caggtacaga ctgagaagta aggctcggag 9900 
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tgatctctgg 


acaggcaatc 


actaagttct 


ttttatcgaa 


ctatgaactg aagttgaatg 


9960 


taccactctc 


atcccatcat 


gctttgctct 


ctatgtgcat 


tcctttaagc ttaccactgg 


10020 


gttacatctt 


tcttgcatct 


taaagaaatg 


ctttcttgcc 


ctcatagccc ctcagatata 


10080 


cgctcccagt 


ccctgataga 


gagaggttct 


cagggagtta 


gctatattct agtttccacc 


10140 


ttactcatgc 


tcactcctga 


gttcaccaca 


gtcacatttc 


actccaccct ctccagtcca 


10200 


ctacaacctg 


tcaggatctt 


caggacatgc 


tatcactgac 


tcaactggtc aatgctctgt 


10260 


cttcatctta 


cttgccccac 


tggtgtattt 


tgtagagagt 


atcataagca tctgaaagaa 


10320 


aagaagggga 


tatgttgttt 


cctgtgagcc 


cttcagtgga 


attttgaagc acggtgaaga 


10380 


tgtctgatgt 


gttaggagaa 


tttctcatgc 


ttctttatgt 


cagttcacag tcttcttgac 


10440 


tatgcgttac 


aaaccggacc 


caggttgaag 


tgagacagtt 


acttagcatg ggtaacccat 


10500 


tgctcaagcc 


agcgacgtct 


tcctagggga 


ccattgtcct 


gtgtccttca ccgattcggc • 


10560 


catgtttgtg 


ttcttttata 


ttctaacttg 


ctgtgatatc 


agggttcgtg aatgttcact 


10620 




nnrfrcanxa 

55 ^> 




10680 


cttttattat 


gctttattcc 


aagaagtttt 


ggaagcaact 


gtgtgacaaa caggataaag 


10740 


gatgtagaag 


gctggaggga 


cagcagtgtt 


ggctatcatc 


tgaatctgaa tatttggaaa 


10800 


atcagctcag 


actgttacct 


agggacaggt 


cactctggtt 


ggaccataaa actcttcttt 


10860 


ctttcagttc 


agtgtaggga 


attaagctac 


aaattctgga 


tggtgcagta ccttagagcc 


10920 


ctggtgttct 


ggctgtgtga 


ggctttcccc 


atctctcagg 


agtgggactg ataacgcctg 


10980 


tggccatcat 


cctttatcct 


tcttggtggg 


tcttttaaaa 


caaggactgt gtcatgctct 


t 1 A A A 

H040 


ctacatttaa 


gaatttcaat 


ggggccacag 


gcgcccctag 


attataaagt tctcataaag 


moo 


atatgttcat 


cacatccctt 


cagctgatct 


ttgggtttgc 


agggctaata tttgattccc 


H160 


aacaatttac 


cttttcataa 


tcttgccatt 


tctacttact 


gtttagatgg aaaagagtaa 


11220 


gttctccttg 


tagactgcag 


tccacctaac 


aacttgcctc 


cagccttgtt aacagtacca 


H280 


caagcagaac 


taaaatcatg 


caaagaattg 


cagaagcatg 


gcagtctcag gaagtttcca 


H340 
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ctccaaaaaa 


accccaaaac aaaaacaaaa 


acaaaaacaa 


aaacaaaaaa ccccacccac 


11400 


tcactaactc 


acataggtgg tttgtttcct 


tttgttggac 


atttacaatt ctatctatct 


11460 


atctatctat 


ctatctatct atctatctat 


ctatctatct 


atctatctat ctatctatcc 


11520 


tactatctat 


ctatctatct atctatctat 


ctatctatct 


atctatctat ctatctatct 


11580 


atctacctac 


ctacctatct atctatctat 


ctatctatct 


atctatctat cctactatct 


11640 


atctatctat 


ctatctatct atctatctat 


ctatctatct 


atccatccac tgtgtgtgtg 


11700 


tacatgtcat 


gcttgtggag gtcagaggac 


aacctatggg 


aattgtttct ctctttccac 


11760 


" . ca tgtgtgtt 


ctggggttca ggtagtcagg 


ctgggtgata 


gcatcttccc tgctgggcta 


.11820, - 


tcccactggc 


agctttgttg tttcacggaa 


tctctggaac 


atcaccctct taatttctct 


11880 


gttcattggc 


tgttctttct gtgccttctt 


tattctgtca 


tcttcgtctc tcacaatgac 


11940 


aaatatttaa 


gttgtcctag gacccagttt 


tcttttaaat 


tattttatga ttttatgcac 


12000 


atgtttcgcc 


tgcatctatg caccatggtg 


tgcctcacgc 


tgctgagttc agaagatatc 


12060 


aggtcccttg 


ggacttgact tacagaggtt 


gtgacaaggc 


ccagttttct tctgcactgg 


T2I3ZO 


cttccttcct 


gattccatcc taactcttgg 


cattaaaggc 


ctgtgtggat tcgatactcc 


12180_ 


cagacgtatc 


tgtgtgctcc cccagaatgt 


attaaatagc 


ctcctcaagt tttccatgta 


12240 


accgtcaaga 


aggtgtatca aatttaaaat 


gcccaaagct 


ggactcttga cacctttata 


12300 


ccttagcttt 


tggtattcct cccccatctg 


gaccagacca 


tagcttcgtc ttccaaattg 


12360 


-Patcrt araar 


tttctttttt aatccttaat 


ttcttttctt 


tctcttatat tcatatcaca 


12420 


atgcacccgt 


tctgtttatt caaccctaag 


atatacccag 


cacttaaccg cttcctaccg 


12480 


tgtatagctt 


gggcagcgta ggtgaatcca 


cttttctaac 


tggattttct gactctatgc 


12540 


tttatagact 


ctctctgtat tagtacacag 


aggggttccc 


tggaaatata agtcagatga 


12600 


tggtatgcct 


ctggaactgt cgatggcttc 


ctagtttgca 


tagattaaaa ccaaaggtcc 


12660 


gggggggggg 


ggctggagag atggctcagt 


ggttaagagc 


agtgactgct cttccaaagg 


12720 
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aattcccagc aaccatatgg tggctcacaa ccatctgtaa 


tgagatctga 


12780 


ctccatcttc 


tggagtgtct gaagacagct acagtgtact cacatataat 


aaaaataaat 


12840 


aaatcttaaa 


aaaaaaaacc aaaggtcctt ctattggtct agtttgtctt 


gctccttttt 


12900 


catcctcgcc 


catccacatt gctttgacac aggtttccct tgatctcgca 


ggtttctgct 


12960 


acatgctctg 


ccatttttga tgcatctttt gccccctttc cttggatgcc 


cacatgatgc 


13020 


catcttacac 


tgttttgcca ttttatctac ttgttttttt cttttacctt 


tttgggtttg 


13080 


ggttaaatgt 


tgttgtatcc ctgattaatc tttacaaata gctaccaccc 


ccccccaaga 


13140 


gtcttattta 


tttctgatta ctttttttaa tggcacatga ttattttgtt 


tttgcttctt 


13200 


taaaaaaacc 


tttttcattc attgataaaa agcacttact ttccttgatg 


tgtatgtctc 


13260* 


ctttgtctaa 


atgtgcaccc cttgtgtgct gtgagtccac aagaaggtgt 


catatcccct 


13320 


ggaactggag 


ttacagatgg ccgtgagcca ccatgtgggt gtggaaatca 


aaccccagta 


13380 


cttgggaaga 


gcagccagtg ctcttagctg ttgataggtc tttccagact 


gtttttactt 


13440 


__„„.ctaaaaatat_qtataaatct ctccctctcc acttcttcca cattaagaat 


ataaggacaa 


13500 


taaagggaca 


caatcttgcc tgtgagtttc attgtggtat agttagcaag 


acgactggat 


13560 


gaatgaaggc 


tgcctaagcc ttcggaactt cctggttagc tcatttctta 


cgtaggaacc 


13620 


caagaagtct 


cacgctttac ctctgttgac tgtagagagg ttacttacta 


ctattttaaa 


13680 


gtttcaggca 


agcgtt.tatg gctgttatat ccccaacctt actcaccgat 


ccctttcttc 


13740 


ccatgtgact 


ttatggaaat tacttaactc cctacatttc agggagctaa 


tctgaaaaaa 


13800 


atgtgataaa 


aatagaatgt agctcaatag aatatagtga agattaagta 


aagtcatgaa 


13860 


caaaaagaat 


ttagttagaa aacctaatgt gtattagggt tctgcaagta 


cttatgattt 


13920 


tataactctg 


ttaataactc attgttttaa gtgcattcat tattcccacc 


actttgaatc 


13980 


attattaaga 


taaatatgat gtttttcaga tcgtccatgc cttaaaaata 


agcagcatat 


14040 


agaaaaccta 


caagaaccag tcctgcaact tttgtttaag ttttcaaaaa 


tgtaccatcc 


14100 


agaagaccca 


cagcatttcg cccacctcat agggaggctt actgaactga 


gaactctgag 


14160 
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tcacagccac tctgaaatcc ttcgcatgtg gaaaacaaag gaccccaggt tggtgatgtt 14220 

attctctgag aaatgggatc tgcactcatt ttcctgaaaa tttacagtgg tgagacttgc 14280 

cttaaaattt caaaatcaaa attggatgaa tgatttgaaa aaaatgtgat aactcaaaat 14340 

cattgagtta tcacacacca ccttaaaatg gagatccaaa aggctcatca gggtttgtgg 14400 

aactttcaac tcttagaaga tgaacactcc aggattcatc tcaagattgg atgcaaagga 14460 

aagaaaaatg gagccatgtg gtggcatagg ctctgtagtt gagagacaga gacaagagga 14520 

ccccatgttt aaggcagtcc tgggctatgt agtgagacct catctcagaa gaaaaaagag 14580 

aagaactgat ggccateatt atgtctccaa tcctcaccta aagaaactaa ttttttgttg. 14640 .. 

ttgttgttta gctgaaacac ataatgtgta aatgtgaaca caggtaggtt atggttagga 14700 

taaggcagtt tgtatgcttc actggttaag gacacatgga tggtgtagca gctaatgctt 14760 

agaacctctt cttcatgaat gattaacgag gaacaagcct tttcaaattg tatctgcttt 14820 

atatttttaa taagtatgtg ataattgtac agaatagtca atattgtaca gaatatttca 14880 
^tttgggaBlri±eatgtg^^ 

atgcccacac tgccaatgcc cttctcatcc ccctaacatt tcccttcctc tgccttgccc 15000 

taaaacacca gattcatacg tgcaagaaaa cattttttaa acaaatttta ttagatattt 15060 

tcttcattta caattcaaat gctatcccaa aagtccctta ttccctcccc ttatcctgct 15120 

ctccaaccca cccacttctg cttcctggcc ctggtattcc cctgtattgg ggtatataat 15180 

ctttgcaaga ccaagggcct ctcctcccaa cgatggtcca ctaggccatc tgcaagaaaa 15240 

catttgattc cagtctgagt ctgttttatt tcctttgagt atgactatct cgtcaatatc 15300 

tgcaaatgac ataatttcat tcctcttaag gatggaaagg actggattgt gtatgtatat 15360 

ctgtatatat tattttattt atccacctgc tgatgacacc tagtttgatt ctgtatcttg 15420 

gccactgtga atcgtgctat aataaaccta tgtaggtttt tagattttat gtggctcttg 15480 

attctttcag gtgtactccc agaagtggta tggtttatat ggtagctcta tttttagatt 15540 
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tgaagggaac tatgtagtga ttttcacagt gattaccttc ttctcaatag tgtataaaga 15600 

ttaatttgcc ttcaaaattc ttgccaattt ttgtttttcc ctttcctttt ctttttaatt 15660 

aaaaaagtgt gtgtgtgtgt gcgtgcatgc caaagaacat tgtggacatc ggaggacagc 15720 

tctgtggaat acattctctc cttccacctt tatgggtggt ctcctctgac cccctctgcc 15780 

atggcctgca tggcttggtt ctggaagcct gattggggtg atgagcgaat gaagaaatga 15840 

cagacatgaa gacacaagta cagaaaagct gagattgggt ggtgtctaag tttctttttt 15900 

attgctataa agagacacca ggaccttatg aaagaaagca ttaaattggg ggcttgctta 15960 

cagtcttaaa gattgcgtcc atgatgatca tggcaggtag catggcacca ggcatgcact 16020 ■ 

ggaacagtat ctgaagagct tacattctga tcggcaagtt ggaggcaggg ggtgggggga 16080 

gaagaaagtg gctgatgggg cctgggacag gatttggaaa tctcaaagcc catccctagt 16140 

gacacacctc ctccaacgag gccacaccca gtcctcctca aataggtcca ccaagtaggg 16200 



<210> 21 

""^H>™T62W 

<212> DNA 

<213> Mus musculus 



<400> 21 „ 

ccctacttgg tggacctatt tgaggaggac tgggtgtggc ctcgttggag gaggtgtgtc bU 

actagggatg ggctttgaga tttccaaatc ctgtcccagg ccccatcagc cactttcttc 120 

tccccccacc ccctgcctcc aacttgccga tcagaatgta agctcttcag atactgttcc 180 

agtgcatgcc tggtgccatg ctacctgcca tgatcatcat ggacgcaatc tttaagactg 240 

taagcaagcc cccaatttaa tgctttcttt cataaggtcc tggtgtctct ttatagcaat 300 

aaaaaagaaa cttagacacc acccaatctc agcttttctg tacttgtgtc ttcatgtctg 360 

tcatttcttc attcgctcat caccccaatc aggcttccag aaccaagcca tgcaggccat 420 

ggcagagggg gtcagaggag accacccata aaggtggaag gagagaatgt attccacaga 480 

gctgtcctcc gatgtccaca atgttctttg gcatgcacgc acacacacac acactttttt 540 
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aattaaaaag aaaaggaaag ggaaaaacaa aaattggcaa gaattttgaa ggcaaattaa 600 

tctttataca ctattgagaa gaaggtaatc actgtgaaaa tcactacata gttcccttca 660 

aatctaaaaa tagagctacc atataaacca taccacttct gggagtacac ctgaaagaat 720 

caagagccac ataaaatcta aaaacctaca taggtttatt atagcacgat tcacagtggc 780 

caagatacag aatcaaacta ggtgtcatca gcaggtggat aaataaaata atatatacag 840 

atatacatac acaatccagt cctttccatc cttaagagga atgaaattat gtcatttgca 900 

gatattgacg agatagtcat actcaaagga aataaaacag actcagactg gaatcaaatg 960 

ttttcttgca gatggcctag tggaccatcg ttgggaggag aggcccttgg tcttgcaaag 1020 

attatatacc ccaatacagg ggaataccag ggccaggaag cagaagtggg tgggttggag 1080 

agcaggataa ggggagggaa- taagggactt ttgggatagc atttgaattg taaatgaaga 1140 

aaatatctaa taaaatttgt ttaaaaaatg ttttcttgca cgtatgaatc tggtgtttta 1200 

gggcaaggca gaggaaggga aatgttaggg ggatgagaag ggcattggca gtgtgggcat 1260 

cacagggtgg gcggcaaata cgccayaaat- atctaatsta tacacatgaa attcccatat 1320 

tgaaatattc tgtacaatat tgactattct gtacaattat cacatactta ttaaaaatat. 1380 

aaagcagata caatttgaaa aggcttgttc ctcgttaatc attcatgaag aagaggttct 1440 

aagcattagc tgctacacca tccatgtgtc cttaaccagt gaagcataca aactgcctta 1500 

tcctaaccat aacctacctg tgttcacatt tacacattat gtgtttcagc taaacaacaa 1560 

caacaaaaaa ttagtttctt taggtgagga ttggagacat aatgatggcc atcagttctt 1620 

ctcttttttc ttctgagatg aggtctcact acatagccca ggactgcctt aaacatgggg 1680 

tcctcttgtc tctgtctctc aactacagag cctatgccac cacatggctc catttttctt 1740 

tcctttgcat ccaatcttga gatgaatcct ggagtgttca tcttctaaga gttgaaagtt 1800 

ccacaaaccc tgatgagcct tttggatctc cattttaagg tggtgtgtga taactcaatg 1860 

attttgagtt atcacatttt tttcaaatca ttcatccaat tttgattttg aaattttaag 1920 

gcaagtctca ccactgtaaa ttttcaggaa aatgagtgca gatcccattt ctcagagaat 1980 
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aacatcacca acctggggtc ctttgttttc cacatgcgaa ggatttcaga gtggctgtga 2040 

ctcagagttc tcagttcagt aagcctccct atgaggtggg cgaaatgctg tgggtcttct 2100 

ggatggtaca tttttgaaaa cttaaacaaa agttgcagga ctggttcttg taggttttct 2160 

atatgctgct tatttttaag gcatggacga tetgaaaaac atcatattta tcttaataat 2220 

gattcaaagt ggtgggaata atgaatgcac ttaaaacaat gagttattaa cagagttata 2280 

aaatcataag tacttgcaga accctaatac acattaggtt ttctaactaa attctttttg 2340 

ttcatgactt tacttaatct tcactatatt ctattgagct acattctatt tttatcacat 2400 

ttttttc'aga ttagctccct gaaatgtagg gagrtaagta atttccataa agtcacatgg 2460 

gaagaaaggg atcggtgagt aaggttgggg atataacagc cataaacgct tgcctgaaac 2520 

tttaaaatag tagtaagtaa cctctctaca gtcaacagag gtaaagcgtg agacttcttg 2580 

ggttcctacg taagaaatga gctaaccagg aagttccgaa ggcttaggca gccttcattc 2640 

atccagtcgt cttgctaact ataccacaat gaaactcaca ggcaagattg tgtcccttta 2700 

ttgtccttat attcttaatg tggaagaagt ggagagggag agacttacac acatcttcag 2760 

aagtaaaaac agtctggaaa gacctatcaa cagctaagag cactggctgc tcttcccaag 2820 

tactggggtt tgatttccac acccacatgg tggctcacgg ccatctgtaa ctccagttcc 2880 

aggggatatg acaccttctt gtggactcac agcacacaag gggtgcacat ttagacaaag 2940 

gagacataca catcaaggaa agtaagtgct ttttatcaat gaatgaaaaa ggttttttta 3000 

aagaagcaaa aacaaaataa tcatgtgcca ttaaaaaaag taatcagaaa taaataagac 3060 

tcttgggggg gggtggtagc tatttgtaaa gattaatcag ggatacaaca acatttaacc 3120 

caaacccaaa aaggtaaaag aaaaaaacaa gtagataaaa tggcaaaaca gtgtaagatg 3180 

gcatcatgtg ggcatccaag gaaagggggc aaaagatgca tcaaaaatgg cagagcatgt 3240 

agcagaaacc tgcgagatca agggaaacct gtgtcaaagc aatgtggatg ggcgaggatg 3300 

aaaaaggagc aagacaaact agaccaatag aaggaccttt ggtttttttt tttaagattt 3360 
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atttattttt attatatgtg agtacactgt agctgtcttc agacactcca gaagatggag 3420 

tcagatctca ttacagatgg ttgtgagcca ccatatggtt gctgggaatt gaactcagga 3480 

cctttggaag agcagtcact gctcttaacc actgagccat ctctccagcc cccccccccc 3540 

ggacctttgg ttttaatcta tgcaaactag gaagccatcg acagttccag aggcatacca 3600 

tcatctgact tatatttcca gggaacccct ctgtgtacta atacagagag agtctataaa 3660 

gcatagagtc agaaaatcca gttagaaaag tggattcacc tacgctgccc aagctataca 3720 

cggtaggaag cggttaagtg ctgggtatat cttagggttg aataaacaga acgggtgcat 3780 

tgtgatatga atataagaga aagaaaagaa actaaggatt aaaaaagaaa gttgcacatg 3840 

caatttggaa gacgaagcta tggtctggtc cagatggggg aggaatacca aaagctaagg 3900 

tataaaggtg tcaagagtcc agctttgggc attttaaatt tgatacacct tcttgacggt 3960 

tacatggaaa acttgaggag gctatttaat acattctggg ggagcacaca gatjcgtctg 4020 

ggagtatcga atccacacag gcctttaatg ccaagagtta ggatggaatc agg&aggaag 4080 

^gtgcsga^agaaaactgg-ge^^ 

gatatcttct gaactcagca gcgtgaggca caccatggtg catagatgca ggcgaaacat 4200 

gtgcataaaa tcataaaata atttaaaaga aaactgggtc ctaggacaac ttaaatattt 4260 

gtcattgtga gagacgaaga tgacagaata aagaaggcac agaaagaaca gccaatgaac 4320 

agagaaatta agagggtgat gttccagaga ttccgtgaaa caacaaagct gccagtggga 4380 

tagcccagca gggaagatgc tatcacccag cctgactacc tgaaccccag aacacacatg 4440 

gtggaaagag agaaacaatt cccataggtt gtcctctgac ctccacaagc atgacatgta 4500 

cacacacaca gtggatggat agatagatag atagatagat agatagatag atagatagat 4560 

agatagtagg atagatagat agatagatag atagatagat agataggtag gtaggtagat 4620 

agatagatag atagatagat agatagatag atagatagat agatagatag atagatagta 4680 

ggatagatag atagatagat agatagatag atagatagat agatagatag atagatagat 4740 

agatagatag aattgtaaat gtccaacaaa aggaaacaaa ccacctatgt gagttagtga 4800 
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gtgggtgggg ttttttgttt ttgtttttgt ttttgttttt gttttggggt ttttttggag 4860 

tggaaacttc ctgagactgc catgcttctg caattctttg catgatttta gttctgcttg 4920 

tggtactgtt aacaaggctg gaggcaagtt gttaggtgga ctgcagtcta caaggagaac 4980 

ttactctttt ccatctaaac agtaagtaga aatggcaaga ttatgaaaag gtaaattgtt 5040 

gggaatcaaa tattagccct gcaaacccaa agatcagctg aagggatgtg atgaacatat 5100 

ctttatgaga actttataat ctaggggcgc ctgtggcccc attgaaattc ttaaatgtag 5160 

agagcatgac acagtccttg ttttaaaaga cccaccaaga aggataaagg atgatggcca 5220 
caggegttat cagtcccact cctgagagat ggggaaagcc tcacacagec agaacaccag - 5280, 

ggctctaagg tactgcacca tccagaattt gtagcttaat tccctacact gaactgaaag 5340 

aaagaagagt tttatggtcc aaccagagtg acctgtccct aggtaacagt ctgagctgat 5400 

tttccaaata ttcagattca gatgatagcc aacactgctg tccctccagc cttctacatc 5460 

ctttatcctg tttgtcacac agttgcttcc aaaacttctt ggaataaagc ataataaaag 5520 



tggacaatat aatggagctg ctctggagcc tatgtgattc aacttcagat gcacatacct 5580 

agtgaacatt cacgaaccct gatatcacag caagttagaa tataaaagaa cacaaacatg 5640 

gccgaatcgg tgaaggacac aggacaatgg tcccctagga agacgtcgct ggcttgagca 5700 

atgggttacc catgctaagt aactgtctca cttcaacctg ggtccggttt gtaacgcata 5760 

gtcaagaaga ctgtgaactg acataaagaa gcatgagaaa ttctcctaac acatcagaca 5820 
' tcttcaccgt gcttcaaaat tccactgaag ggctcacagg aaacaacata tccccttctt . 5880 

ttctttcaga tgcttatgat actctctaca aaatacacca gtggggcaag taagatgaag 5940 

acagagcatt gaccagttga gtcagtgata gcatgtcctg aagatcctga caggttgtag 6000 

tggactggag agggtggagt gaaatgtgac tgtggtgaac tcaggagtga gcatgagtaa 6060 

ggtggaaact agaatatagc taactccctg agaacctctc tctatcaggg actgggagcg 6120 

tatatctgag gggctatgag ggcaagaaag catttcttta agatgcaaga aagatgtaac 6180 
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atgcacatag agagcaaagc atgatgggat 


gagagtggta 


6240 


cattcaactt 


cagttcatag 


ttcgataaaa 


agaacttagt gattgcctgt 


ccagagatca 


6300 


ctccgagcct 


tacttctcag 


tctataccta 


aaaaaaocac tQttcrtcQCC 


gtaagcagag 


6360 


catattcagrt 


atccgataca 


taaaattcac 


tcattcttct ataqaaqtaa 


gatagtgagg 


6420 


caataaattc 


tttagtaata 


trtaaaaaaa 


ttaatataaa taaattttca 


tcggaatgga 


6480 


gcacagaagc 


aaacacattg 


1"ria1*+oi"rtt 


ttrttrtrtc aaacaatota 


agaaaattga 


6540 


aaagagagtg 


gggttcaaga 


tggctaccga 


tgtgatggcc ggcacagaca 


atcccctttg 


6600 


ttgtgccctt 


gaagcccagg 


cctgaacctc 


aagagtgggc gcattaggtg 


tgctactgaa 


6660 


aaggaccaga 


acaaagagag 


gttgtcatca 


cgtggaccaa ggaggaggct 


gtggaactga 


6720 


gagcctttag 


cctacagaag 


ccactctccg 


tgtatctcta ttatctacag 


taaggcttac 


6780 


ccttctaccc 


ataaaagtgg 


ttgatactgg 


gatgttggct gtagattaca 


tcatcatcct 


6840 


aagaataatt 


cagaggcccc 


aaatggcttc 


gcaattaaat agggaaagaa 


ttgccatgat 


6900 



gaa'gccamg to — 6960 

caactgtatt ggctgtgtct ataaatgctt tcaccaccac agagacgcct gtgagctgtg 7020 

agaaagccca tggccttctc tttgtagtgc aatgaaaagg ggagaaaagg aagcgtgtcc 7080 

acccaaggtg atgcttccta ctgcacagtg ttaccagtta gaggagccga agggtagcct 7140 

tccttgagaa agttttccgg agaatgaaca ctttcatcag cgctaggatt atgcacctct 7200 

agtgtcccag ctgagggctt tgctggtctc atagtaccta gcaaaagaga aaaacacaaa 7260 

tgttgatagt ctttcttaat cattatataa atagtaaatt cagtatcttt tcagtatcca 7320 

gcataactat tatgtggggg aaaaaaaaac tcctaaaatg atatcattct catttttgtt 7380 

actggaacac aaaacagata ttagttcata ttttgtaaaa aaaaaaaaaa gattgtacag 7440 

aagaccctcc aaaatgagtt ttgcataatg aaatagaaac taagaatggc tttcttaaac 7500 

acacacaccc tccacgaacc actcagtgca ccagaggctg tggagacttt taagcaaaca 7560 

ctactcaagc aatcatggac caggtcatca agggcctccc tttagagcct tctggtctag 7620 
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tgtttgcaga gaacaatcaa taaataagga tattttaaaa ggctgtgcac tgtaacgagt 7680 

acagctagtg aaggcagcta gcattgttgg cttttggttt tgattttgtc ctattgcctc 7740 

ttcccatgtc ctcaaccaac atgcgacttt acagattgga ctcttgcatt cacacactaa 7800 

acaatgatgg ctttcctgct ctattcaaat ggtttattga ttttttaaaa aatacttgct 7860 

ttttttttat tgaagtgagg tcctgcatag ttggttcaac actaacctaa tagggcttca 7920 

ccccagtaaa gagacgttcc acaaacagta- gcccgtggtt atgagtttca aagcaagcat 7980 

gcggccgtcc ccagagctgc tttgtgtctt agaatcacca acgcctttat gtttgatcac 8040 
ccataaaagt cctcctcatt gga-agagcct gtctaaggag car.agaagcc.ccaccca.cag - -8-100, 

aagccccacc cacagctgag aagctgttgg cagctgctag gggaaggaca atccagtctc 8160 

tttagggaac tgagccctct aagttggcca tgcttcagtg gatgacccta cccccacccc 8220 

cgtgtatatg agcagcacta tttagagtca atgggttatt ttgaaaaaga.aggagcaggg 8280 

aaggacatga agttaggaga gaggtgtgtg ggggattcta aaggagctgg tgggaaggaa 8340 



gtgaggtgag taagatttaa aaaaaaaaac acttaggaca tgtatacagt tctcacaaga 8400 

ataagaaata caatttttaa aagaattaca cggtagagag aaaatgctag atgaatgtga 8460 

aactatttat aaacagctac tatgagacat ttgttcctac actaactggt aaaataaaat 8520 

gaaaagataa atgaaaggct gatgatgtta ggagttagga gttctggctt actgagcaca 8580 

gaccctggct tctgtcccca gcaccataga aggcaggcat agaggctcta ggctgatggt 8640 

tggcaggtaa gagcagacac agcagggctt taagatcctc cttagctaca tactgagttc 8700 

tacatagcct ggactacatg agagtctatt tccaaaaaac agtgtttaaa agtgtgtgcg 8760 

tgcgtgcgtg cgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtc tgtctgtctg 8820 

tctgtctgtc tgtgtgtact tgaagttgaa taaaattctg tttttctttt tacttccaga 8880 

ggttgagtct ttcccaccat aaagctgggc tacatgaagg aacatcactt cagtttttga 8940 

cgccttctgt aatgcagcct gatcctcagt ggttaaattt tcaaatcctt aaaaataaat 9000 
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aaataaataa ataaataaag cataaagaaa cagataatca aaagagaacg ttcataaatt 9060 

aaatgtttaa aaatatttac ttattaaccc taagtgcttc tggggaataa caatagtata 9120 

taaattgaaa tctgaataaa attctttttt tctattttgg ggcactttta tcagcactag 9180 

gttacacttt cttacagcaa ggacaaatca acatagctgc attagccttg aagaatgtaa 9240 

ttttgtaaag agtttctgta cctctaagca tataataaag ctgagttata tataagtttt 9300 

ggtcaaactg tgacttgttt agtattatag aggaaacctt ctttgtttct tgtttcatct 9360 

aagaaagccc acatcattat aataattttt gtaaatattt agattgtcac tcatctgatt 9420 

aattgacaaa gacaaataaa catttttttt ttttttagtt agaagcaaca ttgctagaaa 9480 

acccagggct tcacacacat tagacaggtg ctctgcctct gagcgataga tctagtacca 9540 

caactagaaa tttctgtttg atatactatc cctttaaaaa aaagaaaatt tggggctgga 9600 

gagatagctc agtgggtaag agcactgact actcttccta aaaagccctg agttcaagtc 9660 . 

ccagcaacca cacggtggct cgcaaccatc cgtaacgaga tctaacgccc tcttctggtg 9720 

cacctgaaga cagctacagt gtacttatga aatctttggg ccagaacgag cggggccgac 9780 

cagagtgagc agaggtccta aaaattcaat tctcaacaac cacatgaagg ctcacaacca .9840 

tctgtacagc tacaatgtac tcagataaat aaataaataa ataaataaat aaataaataa 9900 ■ 

ataaataatc ttttttaaaa attgggggct agaaagacag atcagcagtt aagagcctgt 9960 

cttgctcttg cagagactct ggttccaacg cccacatggt aattcacagc catcggtagg 10020 

aaactctagt tctaggggac caaacactct cttctggcct ccacaatatg ttgtgtacat 10080 

-gatatgaata tataattgca atcaaaacat gcataacaca taaacataaa aataaataaa 10140 

tccttaaaaa ataatgaaac agaattttac ctgggagtcc cttggtaaac ttcattagcc 10200 

cttgtatgtg caggactgat acctctgaga gtctcagaaa acttagttcg gggttggaac 10260 

cctcctgcag ctggtccata aaaaaaataa ataaataaaa atgattcata gtatgtcttt 10320 

ctgataatta ctaaggtgtt tcttttttac tgttcttgct atcatttcct aatatcattc 10380 

attacaaatt atttcttgcc agcaatggat acatacaagt ttgcttgttt ctcccaaggg 10440 
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aatcatggat ttttggtgag cagtcactat ggtattcaga agccgatgtt cctcttgagt 10500 

tagagtcatg ttgtcctgaa cctataagga aacagagggc gcctgtaatg tgtgcttgtt 10560 

acacactgtt cataaagaca caagagactg tatggaagaa gcgtcttgcc cctttcccag 10620 

atctggtgct ggatgacacg tgtttggtgt ctgctccttc atcttccact tggatggcag 10680 

ggtacagggc aggcccgtgc ttgaagttct tgcgaagtct ctttgactta cactggatct 10740 

ctgtgagcaa, acctgtgaca ttacagaatt cacattaagt acttttggct cggaaaacaa 10800 

cactgtaggg tttgattaaa atggtcttca gtgagtaaat agtcctagat atctctcgtg 10860 

::*■ ...catgtctgca tctattaaat ttcciattGt/.gcttttcect, tcatagacac cttgtctgct . 10,920- 

taaaaatagt tataacaaag attaagaatt aatttttcat ttccaaatta tgtttagatt 10980 

tttacataag agatttacac attagttgca tcttggaagt gaaaaagtca tttgatagaa 11040 

tagtagaatt acattatcta tgactattat cactggtttt attaataagg ttaaaaaaga 11100* 

ctatcttagg atttttaaaa ataagctaag tcaatttttc ctatgtcaga caaagcattt 11160 

aattcttttt caaaggcaaa taatatttat tttaggattg tttttgtttg tttttaaata 11220 

ttttataatt acttgcatga gtctgtatgg ggaaatgcac aggtgagtgc aggtgcccac 11280. 

agagcacaga ggggtcatag ccccttgaag tgagactata gctggttgtg agctgtagtg 11340 

tatgtactgg actggactgg actactctaa gtacttttag cagctactaa aagtatccaa. 11400 

ctattaatat acctgtcgtt ttcaaagtat ttttaaaaac acacctataa aataaaaaac 11460 

taatgaggag ctgaagtata aacaaaaacc ttaaaacccc tctatattat tttattattt 11520 

gtcagagaac agaatacaca aaaaaataaa gccattttaa gcatattgag tattgaacat 11580 

ggtagtaaac acctataacc ccagctactt aagacactgt tacaagatga tctcacgttg 11640 

aataccagcc atggcaactc agtgtttttt agtaggaaag gagtctatgg gtgtagctta 11700 

gtggtagatt gtctaggatg tgagaaacct gaactcaacc cccaggacca tgagaaggaa 11760 

aaccacaaga agagaacgaa gaggaggaaa aggaggaaga agagggggaa gaggaggaag 11820 
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aggaaaagaa aggaggaaag ggaaaggaga atgagggaag gaagaggagg aggggccgtc 11880 

agagcactaa cacatggcag tattaggaca actgaaaggt aaatgaagtg tgcagatcca 11940 

agcaatggtg gcagggtgcg gaccagctta tgctcgccaa tcaccgtgac tgctcaccaa 12000 

ctctctcgag cttcaccttg ccagaatttc tttaagtttc tttaagtgta ggatgactca 12060 

ttttatgcat agagttcctc agaggactcc aacttccctt tcacatgcat tgcctgccac 12120 

aggttaaaag acaacttttg agccagaaat tggagggcag aggcaggggg atctctgagt 12180 

ttgatgccag cctggcgtac atagcaagtt ccaggtcagc tacatagtga gatcctgaat 12240 

acaaattaaa ttaatgaata aaataaaaga caactttttc aaaggtttct ttggactgag 12300 

ttgtatctcc cactgctcat atgctaatgt gttgatattc agagatgagt cctgacactc 12360 

gtgagactat tagtggactt tacatatgag gatgtcaggg gtccagagat ccagttaccc 12420 

ttcaattgtc ctaagactgt agtgtgttag gactctgaga accccttctc agaaagggaa 12480 

gacttccttt tctcttcttt ctcctcctct ttctttcttt ttgggaacta ctaactttaa 12540 

ctttg cactt ttcaaggcag tatatatttt taggttaagt aacagaaaca tacaatacct 12600 



tttttaaatc actatggagg tttcttttat tttatgtgtg tgttttgctt gcatgtgtgt 12660 

atgcacacaa cactcatacc tgcagaggtc agaagagggc ctgggatgct caggaactga 12720 

gttataaatg attgtgaggg gccacgtgga agctggggac cgggcccagg tcctctccta 12780 

atgctcttaa ctgcctaact actcttcagg tctcagagaa catctaatta agatgaaact 12840 

atgactgatt tcaaaattag agcaatatag aacagggcaa catttatggg aatcactact 12900 

"taaattgata atgggccatc cacgtggcgc aattactgta catatgtgtg tggtgtgcat 12960 

atacaaacga atacggccat gttcaaaata actcttgaag aatgctttat gtcctcaaga 13020 

aggttggaaa tttttaattt tttattaatt tttaaattca tttttattat atagaggaga 13080 

gtgtgaatac agtccttcac taacacaaat gatgcacctg agtttcccac atttgtggaa 13140 

atcacagggg tcagtcagca catccctagt gcaatgggcg attcttgccc tggaaaaacc 13200 

accaccacta tcatgtcctc cttgtcagga aggtaaggct gggaaaagtt aaatcccctc 13260 
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agaaaagcaa gccacgaaca 


cactggcact 


tacattctgc 


caacatcccc 


accgccttac 


13320 


acttcttcag tctgcactct 


tggcattttc 


tgcgcatgta 


catgtccatt 


tcacagtgac 


13380 


caccgttctt gcaagaatac 


actgcattct 


tggtaatgct 


acgccggaaa 


aagcctaaca 


13440 


acaaagatgg tcaaagtttg 


acatggatat 


gtttagttac 


ttctcatttt 


tattaacatt 


13500 


ttacctttat agaaagtaat 


caagctaact 


agagtatttc 


aattaaatag 


ggtttttttt 


13560 


attcccgaca tttttttaaa 


tatacattta 


tctggaaagt 


tagctaattc 


tataattctt 


13620 


tatagttagt atagttcagt 


tgacatcctg 


aggctgggtc 


tttctttata 


acgtctggag 


13680 


attaaatgga .\gtatcattat 


cacattaaga 


caactctcta. 


coctactata 


ggtaaattac 


13740. 


ctataagaat ttttttttta 


actgcattat 


acaaaatatc 


aagatcctac 


aatgcatctc 


13800 


tcgagtttcc tttgagagaa 


tacatggtca 


cctaaaataa 


agaaaaaatt 


cagaaaagcc 


13860 


tctcttttct aaaccctccc 


agtctgagtt 


gcagcccaca 


aaggtggtcc 


ctccctccct 


13920 


catccgcaga tgctctctct 


ccctttccca 


gcacttgcca 


gatagttccc 


atcccttttc 


13980 



ctc^ 

gtcccaaagc cccaccactt actctcttgc agtggacact tgaaagatgg agtttcgcca 14100 

tttcagttca acgaaggaag aaaaagctca gctcaggttt tcaaaactgt gctgggttct 14160 

tcagctcagt tttaacagta acccgtcacc gtcaagagca ggccacccct gcctaactac 14220 

tcgccacagg actctgtcac ctcatatcct ttggctattc tttgccagtg ttgaggttta 14280 

agatttacag gtaaaatttt accctaaaca aacacgttta tttgtgttac attgactaaa 14340 

ctgttttctg ttgtttgggt tgttttgttt tgtttcagag ggtgctagaa tcaaatccaa 14400 

aagtaggtgc taagtgctct accactggcc tgcattccca tcccgatgtt tatccatttc 14460 

tggagtggta gactgtttca gaactgttat ttatcagaga ggcttccgct ctttacaaat 14520 

atcattgcaa acactggccc ttgcttcccg actttcagct tgttctcttt ctatacgaaa 14580 

cactttttaa aatttattga cttagaatca tagaagtcat ttatttcaaa ttttctccaa 14640 
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gtgtataaat 


tttatcaacc acataaaaag gtgtcatggt ttacagatct ttacattgtc 


14700 


atgttttgtt 


ccagcatcaa attcttctag aaatagggaa aacgtttgct tatatgtatc 


14760 


gagtttaaag 


taaagtgtga aagtcttcat caaaaatgaa gagacagacc tggtagtgaa 


14820 


tgctcttaat 


gccagcagta aggatacaaa ggtaggtgga tctctgagag ttcaaggaca 


14880 


gcctggtcta 


aacagtgagt tccaggacag ccagggatgc atagagagac cttttctcaa 


14940 


aaaaagaaga 


aagatgagga ggagaagatg aggacgaagg ggaagacaag ggaggagaaa 


15000 


aaaaaataaa 


gaaagaaatg gtccaggcat ggtggcccac acctttaatc ccagcaccta 


15060 


gaaagcagag 


acaggcaggt atctgtgagt tcaaagccaa cctgatctcc atagtgaggt 

-•**;• --—■>---- - • - ' - V • - — «- «. • 


15120 


ctaggccagc 


-. . . * "• — - - - - *-- • - .•.*."■ . - x. , 

cagggctata caatgatact ctacttcaaa agagagagag ggggggaggg 


15180 


; v agggagagag 


agagagagag agagagagag agagagagag agagagagag agagagagag 


15240 


agagagagag 


agagagggag agagagagag agagagagag agagagagag agagagagag 


15300 


agagagagag 


agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa agaaagaaag 


15360 


aaagaaagaa 


agaaagaaag aggaagggaa agggaaaggg aaagggaaag ggaaaggaaa 


15420 


~gga-aaggaaa-ggaaaggaa 



tattctccta tctttgatca cacagattga tccctttaga catacttttt cctggttttg 15540 

tttctcatgt tttataacga ttgcacacgc acgtgcacac acacatacac acgtgtgcgc 15600 

gtgcgcagca ctgtctgttt ggttgcgcct tacccttacc tttgcacccc tcacaagtaa 15660 

gtgcattgta gtgatatcct gatgccttat caccgcagac catgcagagc atgtcatctt 15720 

gacccttcca gcgtaaagaa caagttgctc tgcacctttg tgccccaggg aattcatctt 15780 

catcatcgtg aacaaccaca taagtggact gaccagattc tcgagcgtcg agtccacacc 15840 

aactgcctcc actgtactgt ggatccaagc catgtgtgtt gaaatgactt tgtaaagatg 15900 

gggactgtaa cgctggagga aactgagcgg tagaatattg acaatagggt ggttcttgga 15960 

aatctgtatc acacagctga taatggaatt gttctggcaa aatatctggt tttattaaca 16020 

taaaaaagaa caaatcataa aaggtcaaaa aagttcataa ttgttttaaa cctcaaaatg 16080 
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ttttatactc aacagaaagt tagttcaaca catctattcc atctcaggac ttgccgtpca 16140 

gcacaatagg aacacagtaa atggaactaa caatacctaa taaatgctat gaccccaaag 16200 

<210> 22 
<211> 1428 
<212> DNA 
<213> Mus musculus 

<400> 22. 

atgttaataa aaccagatat tttgccagaa caattccatt atcagctgtg tgatacagat 60 

ttccaagaac caccctattg tcaatattct accgctcagt ttcctccagc gttacagtcc 120 

ccatctttac aaagtcattt caacacacat ggcttggatc cacagtacag tggaggcagt 180 

tggtgtggac tcgacgctcg agaatctggt cagtccactt atgtggttgt tcacgatgat 240 

gaagatgaat tccctggggc acaaaggtgc agagcaactt gttctttacg ctggaagggt 300 

caagatgaca tgctctgcat ggtctgcggt gataaggcat caggatatca ctacaatgca 360 

-et-t-aettgtg-aggggt^^ 420- 

tcttgcaaga acggtggtca ctgtgaaatg gacatgtaca tgcgcagaaa atgccaagag 480 

tgcagactga agaagtgtaa ggcggtgggg atgttggcag aatgtttgct cacagagatc 540 ■ 

cagtgtaagt caaagagact tcgcaagaac ttcaagcacg ggcctgccct gtaccctgcc . 600 

atccaagtgg aagatgaagg agcagacacc aaacacgtgt catccagcac cagatctggg 660 

aaaggggttc aggacaacat gactctaact caagaggaac atcggcttct gaataccata 720 

gtgactgctc accaaaaatc catgattccc ttgggagaaa caagcaaact tctgcaggag 780 

ggttccaacc ccgaactaag ttttctgaga ctctcagagg tatcagtcct gcacatacaa 840 

gggctaatga agtttaccaa gggactccca ggatttgaaa atttaaccac tgaggatcag 900 

gctgcattac agaaggcgtc aaaaactgaa gtgatgttcc ttca'tgtagc ccagctttat 960 

ggtgggaaag actcaacctc tggaagtact atgagaccag caaagccctc agctgggaca 1020 

ctagaggtgc ataatcctag cgctgatgaa agtgttcatt ctccggaaaa ctttctcaag 1080 
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gaaggctacc cttcggctcc tctaactgga agaatgagtg aacttcatgt atcggatact 1140 

gaatatgctc tgcttacggc gacaacagtg cttttctcag atcgtccatg ccttaaaaat 1200 

aagcagcata tagaaaacct acaagaacca gtcctgcaac ttttgtttaa gttttcaaaa 1260 

atgtaccatc cagaagaccc acagcatttc gcccacctca tagggaggct tactgaactg 1320 

agaactctga gtcacagcca ctctgaaatc cttcgcatgt ggaaaacaaa ggaccccagg 1380 

ttggtgatgt tattctctga gaaatgggat ctgcactcat tttcctga 1428 

<210> 23 

■,:<m>: -.1428 \. ..." ' . •. • v" ' "v ' ' • ; 

<212> DNA 
<213> Mus musculus 

<400> 23 

tcaggaaaat gagtgcagat cccatttctc agagaataac atcaccaacc tggggtcctt 60 

tgttttccac atgcgaagga tttcagagtg gctgtgactc agagttctca gttcagtaag 120: 

cctccctatg aggtgggcga aatgctgtgg gtcttctgga tggtacattt ttgaaaactt 180 



aaacaaaagt tgcaggactg gttcttgtag gttttctata tgctgcttat ttttaaggca 240 

tggacgatct gagaaaagca ctgttgtcgc cgtaagcaga gcatattcag tatccgatac 300 

atgaagttca ctcattcttc cagttagagg agccgaaggg tagccttcct tgagaaagtt 360 

ttccggagaa tgaacacttt catcagcgct aggattatgc acctctagtg tcccagctga 420 

gggctttgct ggtctcatag tacttccaga ggttgagtct ttcccaccat aaagctgggc 480 

"tacatgaagg aacatcactt cagtttttga cgccttctgt aatgcagcct gatcctcagt 540 

ggttaaattt tcaaatcctg ggagtccctt ggtaaacttc attagccctt gtatgtgcag 600 

gactgatacc tctgagagtc tcagaaaact tagttcgggg ttggaaccct cctgcagaag 660 

tttgcttgtt tctcccaagg gaatcatgga tttttggtga gcagtcacta tggtattcag 720 

aagccgatgt tcctcttgag ttagagtcat gttgtcctga acccctttcc cagatctggt 780 

gctggatgac acgtgtttgg tgtctgctcc ttcatcttcc acttggatgg cagggtacag 840 
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His Ser Pro Glu Asn Phe Leu Lys GTu Gly Tyr Pro Ser Ala Pro Leu 
355 360 365 

Thr Gly Arg Met Ser Glu Leu His Val Ser Asp Thr Glu Tyr Ala Leu 
370 375 380 

Leu Thr Ala Thr Thr Val Leu Phe Ser Asp Arg Pro Cys Leu Lys Asn 
385 390 395 400 

Lys Gin His lie Glu Asn Leu Gin Glu Pro Val Leu Gin Leu Leu Phe 
405 410 415 

Lys Phe Ser Lys Met Tyr His Pro Glu Asp Pro Gin His Phe Ala His 
420 425 430 

"Leu'iVe Gly Arg Leu Thr Glu Leu Arg Thr Leu Ser His Ser His Ser 
435 440 445 

Glu He Leu Arg Met Trp Lys Thr Lys Asp Pro Arg Leu Val Met Leu 
450 455 . 460 

Phe Ser Glu Lys Trp Asp Leu His Ser Phe Ser 
465 470 475 
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ggcaggcccg tgcttgaagt tcttgcgaag tctctttgac ttacactgga tctctgtgag 900 

caaacattct gccaacatcc ccaccgcctt acacttcttc agtctgcact cttggcattt 960 

tctgcgcatg tacatgtcca tttcacagtg accaccgttc ttgcaagaat acactgcatt 1020 

cttggtaatg ctacgccgga aaaagccttt gcacccctca caagtaagtg cattgtagtg 1080 

atatcctgat gccttatcac cgcagaccat gcagagcatg tcatcttgac ccttccagcg 1140 

taaagaacaa gttgctctgc acctttgtgc cccagggaat tcatcttcat catcgtgaac 1200 

aaccacataa gtggactgac cagattctcg agcgtcgagt ccacaccaac tgectccact 1260 

.gtactgtgga..tceaagccat;stgtgttgaa atgactttgt aaagatgggg actgtaacgc- . - : 1320.: 

tggaggaaac tgagcggtag aatattgaca atagggtggt tcttggaaat ctgtatcaca 1380 

cagctgataa tggaattgtt ctggcaaaat atctggtttt attaacat 1428 



<210> 24 
<211> 475 

-^212>_BRX- 



<213> Mus musculus 
<400> 24 

Met Leu lie Lys Pro Asp He Leu Pro Glu Gin Phe His Tyr Gin Leu 
1 5 10 15 

Cys Asp Thr Asp Phe Gin Glu Pro Pro Tyr Cys Gin Tyr Ser Thr Ala 
20 25 30 

Gin Phe Pro Pro Ala Leu Gin Ser Pro Ser Leu Gin Ser His Phe Asn 
35 40 45 

Thr His Gly Leu Asp Pro Gin Tyr Ser Gly Gly Ser Trp Cys Gly Leu 
50 55 60 

Asp Ala Arg Glu Ser Gly Gin Ser Thr Tyr Val Val Val His Asp Asp 
65 70 75 - 80 

Glu Asp Glu Phe Pro Gly Ala Gin Arg Cys Arg Ala Thr Cys Ser Leu 
85 90 95 
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Arg Trp Lys Gly Gin Asp Asp Met Leu Cys Met Val Cys Gly Asp Lys 
100 105 110 

Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr Cys Glu Gly Cys Lys Gly 
115 120 125 

Phe Phe Arg Arg Ser He Thr Lys Asn Ala Val Tyr Ser Cys Lys Asn 
130 135 140 

Gly Gly His Cys Glu Met Asp Met Tyr Met Arg Arg Lys Cys Gin Glu 
145 150 155 160 

Cys Arg Leu Lys Lys Cys Lys Ala Val Gly Met Leu Ala Glu Cys Leu 
165 170 175 

Leu.ThrGJu Ilr-Gln. Cys/tys.5er-..J.ysArgLeu Arg:.Lys.Asn Phe "Lys .- nrr^r ■; -.. 
180 185 190 

His Gly Pro Ala Leu Tyr Pro Ala He Gin Val Glu Asp Glu Gly Ala 
195 200 205 

Asp Thr Lys His Val Ser Ser Ser Thr Arg Ser Gly Lys Gly Val Gin 
210 215 220 

-Asp-Asn-Met-T*!U^u4h£j^ ' — 

225 230 235 240 

Val Thr Ala His Gin Lys Ser Met He Pro Leu Gly Glu Thr Ser Lys 
245 250 255 

Leu Leu Gin Glu Gly Ser Asn Pro Glu Leu Ser Phe Leu Arg Leu Ser 
260 265 270 

Glu Val Ser Val Leu His He Gin Gly Leu Met Lys Phe Thr Lys Gly 
275 280 -285 

Leu Pro Gly Phe Glu Asn Leu Thr Thr Glu Asp Gin Ala Ala Leu Gin 
290 295 300 

Lys Ala Ser Lys Thr Glu Val Met Phe Leu His Val Ala Gin Leu Tyr 
305 310 315 320 

Gly Gly Lys Asp Ser Thr Ser Gly Ser Thr Met Arg Pro Ala Lys Pro 
325 330 335 

Ser Ala Gly Thr Leu Glu Val His Asn Pro Ser Ala Asp Glu Ser Val 
340 345 350 
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